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The invention relates to a method for producing 
fatty acid esters which contain unsaturated fatty 
acids with at least three double-bonds, and free 
unsaturated fatty acids having a content of at least 1 
wt- %, in relation to the total amount of fatty acids 
contained in plants, by expression of at least one 
nucleic acid sequence which codes for a polypeptide 
having D-6-desaturase activity, and at least one 
nucleic acid sequence which codes for a polypeptide 
having D-6-elongase activity. Advantageously, said 
nucleic acid sequences can be expressed, optionally, 
together with a third nucleic acid sequence in the 
transgenic plant, which codes for a polypeptide having 
D-5-desaturase activity. The invention also relates to 
the use of defined nucleic acid sequences which code 
for polypeptides having a D-6-desaturase activity, D-6- 
elongase activity or D-5-desaturase activity, selected 
from a group of nucleic acid sequences, or to the use 
of nucleic acid structures containing the above- 
mentioned nucleic acid sequences. 



Bloayafc&aili chain 

Lical«ic acid 



T-Uooleaic *cjd 



Oibo»o-T-linoX«iHc acid 



Acocbidonlc acid 



AS-Deioturam 



A6-Blongase 



A5-OM«tOr»i8 



Liooluaic acid 



StimrlrlunJc «oi«l 



Elco«*t*tra«»oie *cid 



1*1 



Office de la Propriete 

Intellectuelle 

du Canada 

Un organisme 
d'lndustrie Canada 



Canadian 

Intellectual Property 
Office 

An agency of 
Industry Canada 



CA 2485060 A1 2003/1^/13 * 

pi, 2 485 060 

02)DEMANDE DE BREVET CANADIEN 
CANADIAN PATENT APPLICATION 

(13) A1 



(86) Date de depdt PCT/PCT Filing Date: 2003/04/25 

(87) Date publication PCT/PCT Publication Date: 2003/11/13 

(85) Entree phase nationale/National Entry. 2004/10/28 

(86) N° demande PCT/PCT Application No.: EP 2003/004297 

(87) N° publication PCT/PCT Publication No.: 2003/093482 
(30) Priorite/Priority: 2002/04/29 (102 19 203.0) DE 



(51) Cl int 7/lntCI. 7 C12N 15/82, C12P 7/64 

(71) Demandeur/Applicant: 

BASF PLANT SCIENCE GMBH, DE 

(72) Inventeurs/lnventors: 
CIRPUS, PETRA, DE; 
RENZ, ANDREAS, DE; 
LERCHL. JENS, SE; 
KUIJPERS, ANNE-MARIE, DE 

(74) Agent: ROBIC 



Biosynthesis chain 



l,inoleic acid 



-Linolenic acid 



Linolenic acid 



XOOH 



A6-Desaturase 



Stearidonic acid 



"CCOH 



^COOH 



V COOH 



Dihomo-Y-l inolenic acid 



Arachidonic acid 



A6-Elongase 

"COOH 

A5-Besaturase 

^COOH 



Eicosatetraenoic acid 



Eicosapentaenoic acid 



V COOH 



^COOH 



(57) Abrege/Abstract: hi h contain unsaturated fatty acids with at least three double- 

The invention relates to a method for producing to the total amount of fatty acids contained 

bonds and free unsaturated fatty acids having a content of at ^ast 1 wt. /o n r ^ . D . 6 -desaturase activity, and at 

" plants, by expression of at least one nucl^ AoVantageous*. sale J nucleic ac,d 

least one nucleic acid sequence which codes for a ' P * 1 *^^ sequence in the transgenic plant, which codes for a 

sequences can be expressed, optionally to 9f^^ use of defined nucleic acid sequences which code for 

polypeptide having D-5-de* aturase ac * vl * or D-5-desaturase activity, selected from a group of nucleic 

£3JS^ above-mentioned nucleic acid sequence, 



Canada 1 



http://opic.gc.ca 



■ Ottawa-Hull Kl A 0C9 - http://cipo.gc.cB 

OPIC-CIPO 191 



O PI c 




CIPO 



CA 02485060 2004-10-2B 



(12) NACH DEM VERTRAG UBER DIE INTERNATIONALE ZUSAMMEN ARBEIT AUF DEM GEBIET DES 
PATENT W ESENS (FCT) VEROFFENTL1CHTE INTERNATIONALE ANMEJLDUNG 



(19) Wcltorganisation fur gcistigcs Eigcntum 
Internationales Btiro 

(43) Interna lionales VeroJTentlichungsdaluin 
13. November 2003 (13.11.2003) 




PCX 



(10) Internationale VerofTentlichungsnummer 

WO 2003/093482 A3 



(51) Internationale PatenlklassiDkation 7 : C12N 15/82, 

C12P 7/64 



(21) Internationales Aktenzelchen: 



l > CT/El > 2003/004297 



(22) Internationales Anmeldedatum: 

25. April 2003 (25.04.2003) 



(25) Einreichungssprache: 

(26) Verofientlichungssprache: 



Deutsch 
Deutsch 



(30) Angaben zur Prioritat: 

102 19 203.0 29. April 2002 (29.04.2002) DE 

(71) Anmelder (ju r alle Bestirnrn ungsstaaten mit Ausnahrne von 
US): BASF PLANT SCIENCE GMBH [DE/DE]; Carl 
Bosch-Str. 38, 67056 Ludwigshafen (DE). 



(72) ErOnder; und 

(75) Erflnder/Anmelder (nur fUr US)i CIRPUS, Petra 
IDE/DFJ; Landteilstr.12, 68163 Mannheim (DE). KENZ, 
Andreas IDE/DEJ; lleiiuich-von-KJeist-Str.6, 67117 Lim- 
burgerhof (DE)- LERCHI., Jens [DE/SK]; Onsjovagen 
17, S-26831 Svalov (SE). KUIJPERS, Anne-Marie 
[NUDE], Pappelstr. 31, 67166 Otterstadt (DE). 

(74) Anwall: PRESSLER, Uwe; c/o BASF Aktiengc- 
sellschaft, 67056 Ludwigshafen (DE). 

(81) Bestimmungsstaaten (national): AE, AG, AL, AM, AT, 
AU, AZ, BA, BB, BO, BR, BY, BZ, CA, CH, CN, CO, CR, 
CU 5 CZ, DE, DK. DM, DZ, EC, EE, ES, FI, GB, GD, GE, 
GH, GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, 
KZ, LC, LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, 
MN, MW, MX, MZ, NI, NO, NZ, OM, PH, PL, PT, RO, 

[Fortsetzttng auf tier ruichsien Seite] 



(54) Title: METHOD FOR PRODUCING MULTIPLE UNSATURATED FATTY ACIDS IN PLANTS 

(54) Bezelchnung: VHRFAHREN ZUR HBRSTELLUNG MEHRFACH U NGES A1T1GTER FEITS A U REN _1N PFLANZEN 



BB 



i.!nol»*sKur« 



t CC 



GtcAr i danoaure 



♦ DO 



Bicosatetreensa-jre 



E ioop«nc atnluz • 



CC 
DD 



BIOS YNTI CSS CHAIN 
LHVOLEIC AC tO 
H-INOLEfC ACID . 
UIHOVK> r -LINOLEIC ACJO 
ARACHIDONIC ACJD 



< 
oo 

f*5 



JJ 

KK 
LL 



UNOLEICAr.lO 
STEARIDfC ACID 
EICOSATETKAENOIC ACID 
Ef CO PE.MTAENOJ C ACID 



(57) Abstract: The invention relates to a method for producing fatty acid esters which contain unsaturated fatty acids with at least 
three double-bonds, and free unsaturated fatty acids having a content of at least 1 wt.- %, in relation to the total amount of fatty 
acids contained in plants, by expression of at least one nucleic acid sequence which codes for a polypeptide having D-6-dcsaturasc 
activity, and at least one nucleic acid sequence which codes for a polypeptide having D-6-clongasc activity. Advantageously, said 
nucleic acid sequences can be expressed, optionally, together with a third nucleic acid sequence in the transgenic plant, which codes 
for a polypeptide having D-5-desaturase activity, t he invention also relates to the use of defined nucleic acid sequences which code 
for polypeptides having a D-6-desaturase activity, D-6-elongase activity or D-5-desaturase activity, selected from a group of nucleic 
acid sequences, or to the use of nucleic acid structures containing the above-mentioned nucleic acid sequences. 



(57) Zusamraeafassung: Die vorliegendc Erfindung betriCTtein Verfahrcn zur Herstel lung von FcUsaurecstcr, die ungcsaltiglc Fclt- 
sSurcn mit mindestens drci Doppclbindungcn cnthaltcn, sowic frcicn ungcsSttigtcn Fcttsaurcn mit cincm Gchalt von mindestens 1 
O Gew.-% bezogen auf die gesamten in der Pflanzen enthaltenden FettsMuren 

^ / to rtsetzung auf der n&chsten Seite J 



CA 02485060 2004-10-28 



WO 2003/093482 A3 IIIIII1HIIIIIIIIIIIIIIIIIIIIIIIIII1IIIIIII1IIIIIIIIII1 



RU, SC, SD, SE, SG, SK, SL, TJ, TM, TN, TR, TT. TZ, 
UA, UG, US, UZ, VC, VN, YU, ZA, ZM, ZW. 

(84) Bestimmungsstaaten (regional): ARIPO Patent (GH, 
GM, KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
eurasisches Patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, 
TM), europiiisches Patent (AT, BE, BG, CH, CY, CZ, DE, 
DK, EE, ES, FT, FR, GB, GR, HU, IE, IT, LU, MC, NL, 
PT, RO, SE, SI, SK, TR), OAPI Patent (BF, BJ, CF, CG, 
CI, CM, GA, GN, GQ, GW, ML, MR, NE, SN, TO, TG). 

Veroffentlicht: 

— mil internationalem Recherchenbericht 



— vor Ablauf der fur Anderungen der Anspruche geltenden 
Frist; Veroffenilichung wird wiederholl, falls Anderungen 
eintreffen 

(88) VcrofTcntJichungsdatuiH dcs intern at ion a Jen 

Rcchcrchcnbcrichts: 4. November 2004 

Zur Erkldrung der Zweibuchstaben-Codes und der anderen Ab- 
kurzwigen wird auf die Erklarungen ("Guidance Notes on Co- 
des and Abbreviations") am Anfangjeder reguldren Ausgabe der 
PCT -Gazette verwiesen. 



durch Expression mindestens cincr NukJcinsaurcscqucnz, die fur cin Polypcptid rait D-6-Dcsaturascaktivitat codiert, und mindestens 
einer Nukleinsauresequenz, die fiirein Polypeptid mit D-6-Elongaseaktivitat codiert. Vorteilhaft kflnnen diese Nukleinsauresequen- 
zen gegebenertfalls zusammen mit einer dritten Nukleinsauresequenz, die fur ein Polypeptid mit D-5-DesaturaseaJctivitat codiert, 
in der transgenen Pflanze exprimiert werden.Die Erfindung betrifft weiterbin die Verwendung defin letter Nukleinsauresequenzen, 
die fur Polypeptide mit einer D-6-Desaturaseaktivitat, D-6-Elongaseaktivitat oder D-5-Desaturaseakti vital codieren ausgewablt aus 
einer Gruppe von Nukleinsauresequenzen bzw. die Verwendung von Nukleinsaurekonstrukten en thai ten die vorgenannten Nuklein- 
sauresequenzen. 



CA 02485060 2004-10-28 



1 

METHOD FOR PRODUCING MULTIPLE UNSATURATED FATTY ACIDS IN 

PLANTS 

Description 

The present invention relates to a method for the production of 
fatty acid esters which comprise unsaturated fatty acids with at 
Least three double bonds, and to free unsaturated fatty acids 
with a content of at least 1% by weight based on the total fatty 
acids present in the plants, by expressing at least one nucleic 
acid sequence which encodes a polypeptide with A6-desaturase 
activity and at least one nucleic acid sequence which encodes a 
polypeptide with A6-elongase activity. Advantageously, these 
10 nucleic acid sequences can, if appropriate, be expressed in the 
transgenic plant together with a third nucleic acid sequence 
which encodes a polypeptide with A5-desaturase activity . 

The invention furthermore relates to the use of defined nucleic 
acid sequences which encode polypeptides with a A6-desaturase 
activity, A6-elongase activity or A5-desaturase activity selected 
from a group of nucleic acid sequences, and/or to the use of 
nucleic acid constructs comprising, the abovementioned nucleic 
acid sequences. 

Certain products and by-products of naturally occurring metabolic 
processes in microbial cells or in the cells of animals and, 
advantageously plants, have utility for a wide range of 
20 industries, including the feed, food, cosmetics and 

pharmaceutical industries. These molecules, which are 
collectively termed "fine chemicals", also include, for example, 
lipids and fatty acids, one representative class of which are the 
polyunsaturated fatty acids. Polyunsaturated fatty acids (PUFAs) 
are added for example to infant formula for increasing the 
nutritional value of these foods. PUFAs have, for example, a 
positive effect on the cholesterol level in the blood of humans 
and are therefore useful for protection against heart disease. 
Fine chemicals such as polyunsaturated fatty acids (PUFAs) can be 
isolated from animal sources such as, for example, fish, or 
produced by microorganisms by culturing microorganisms which have 
been developed such that they produce and accumulate or secrete 
large amounts of one or more desired molecules. 

30 

Fatty acids and triglycerides have a multiplicity of uses in the 
food industry , in animal nutrition, in cosmetics and in the 
pharmacological sector. Depending on whether they take the form 
of free saturated or unsaturated fatty acids or triglycerides 
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with an increased content of saturated or unsaturated fatty 
acids , they are suitable for a variety of uses. Polyunsaturated Q 
3-fatty acids and Q 6-f atty acids constitute an important part of 
animal and human nutrition. Owing to the present-day composition 
5 of human nutrition, an addition of polyunsaturated Q 3 -fatty 

acids, which are predominantly found in fish oils, to the food is 
of particular importance- Thus, for example, polyunsaturated 
fatty acids such as docosahexaenoic acid (=DHA, 
C22:6 A 4, 7, 10, 13,16,19) or eisosapentaenoic acid (= EPA, 
10 C20:5 A5 > 8aU4 » 17 ) is added to baby formula for increasing the 

nutritional value. DHA is said to have a positive effect on brain 
development. 

The various acids and triglycerides are obtained mainly from 
15 microorganisms such as Mortierella or from oil-producing plants 
such as soybeans, oilseed rape, sunflower, algae such as 
Cryptocodinium or Phaeodactylum and others, the products being 
obtained, as a rule, in the form of their triacylglycerides {» 
triglycerides - triglycerols) . However, they can also be obtained 
20 from animals such as, for example/ fish. The free fatty acids are 
advantageously prepared by hydrolysis. Higher polyunsaturated 
fatty acids such as DHA, EPA, arachidonic acid (« ARA, 
C20:4 A5A11 - 14 ) , dihomo-v-linolenic acid (C20 :3 A 8,u,i4) or 
docosapentaenoic acid (DPA, C22:5 A 7,io,i3,i6,i9) cannot be isolated 
25 from oil crops such as oilseed rape, soybeans, sunflower, 

saf flower or others. Conventional natural sources of these fatty 
acids are fish such as herring, salmon, sardine, redfish, eel, 
carp, trout, halibut, mackerel, zander or tuna, or algae. 

30 Depending on the intended purpose, oils with saturated or with 
unsaturated fatty acids are preferred; thus, for example, lipids 
with unsaturated fatty acids, specifically polyunsaturated fatty 
acids, are preferred in human nutrition. The polyunsaturated Q 
3-fatty acids are said to have a positive effect on the 

35 cholesterol level in the blood and thus on the possibility of 
preventing heart disease. The risk of heart disease, stroke or 
hypertension can be reduced markedly by adding these co3-fatty 
acids to the food. Also, Q 3-fatty acids can have a positive 
effect on inflammatory processes, specifically chronically 

40 inflammatory processes in connection with immunological diseases 
such as rheumatoid arthritis. These fatty acids are therefore 
added to foodstuffs, specifically dietetic foodstuffs, or are 
used in medicaments. 

45 In connection with these rheumatic diseases due to the usual 

composition of our foods, Q 6- fatty acids such as arachidonic acid 
tend to have a negative effect on these diseases. 
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Q3- and Q 6-f atty acids are precursors of tissue hormones, what 
are known as eicosanoids such as the postaglandins , which are 
derived from dihomo-Y-li n °l en ^ c acid, arachidonic acid and 
eicosapentaenoic acid, the thrornoxanes and the leukotrienes , 
5 which are derived from arachidonic acid and eicosapentaenoic 

acid. Eicosanoids (known as the PG2 series), which are formed from 
Q 6-f atty acids, promote, as a rule, inflammatory reactions, while 
eicosanoids (known as the PG3 series) from Q 3-f atty acids have a 
minor, or no, proinflammatory action. 

10 

Owing to the positive properties, there has been no lack of 
attempts in the past to make available genes which are involved 
in the synthesis of fatty acids or triglycerides, for the 
production, in various organisms, of oils with a modified content 

15 of unsaturated fatty acids. Thus, WO 91/13972 and its US 

equivalent describe a A9— desaturase . A A15— desatufase is claimed 
in WO 93/11245 and a A12— desaturase is claimed in WO 94/11516- 
Further desaturases are described, for example, in 
EP-A-0 550 162, WO 94/18337, WO 97/30582, WO 97/21340, - 

20 WO 95/18222, EP-A—6 794 250, Stukey et al., J. Biol. Chem. , 265, 
1990: 20144-20149, Wada et al., Nature 347, 1990: 200-203 
or Huang et al ., Lipids 34, 1999: 649—659. However, the 
biochemical characterization of the various desaturases is 
incomplete as yet since the enzymes, being membrane -bound 

25 proteins, can only be isolated and characterized with great 
difficulty (McKeon et al., Methods in Enzymol. 71, 1981: 
12141—12147, Wang et al., Plant Physiol. Biochem., 26, 1988: 
777^792). As a rule, membrane -bound desaturases are characterized 
by introduction into a suitable organism which is subsequently 

30 analyzed for enzyme activity by means of analyses of the starting 
material and the product. A6— Desaturases are described in 
W0 93/06712, US 5,614,393, US5614393, WO 96/21022, WO00/21557 and 
WO 99/27111, and their application for the production in 
transgenic organisms has also been described, such as in 

35 W098/46763 W096/46764, W09846765. In this context, the expression 
of various desaturases is also described and claimed, as is the 
case in W099/64616 or W098/46776, as is the formation of 
polyunsaturated fatty acids. As regards the efficacy of the 
expression of desaturases and their effect on the formation of 

40 polyunsaturated fatty acids, it must be noted that only minor 
contents of A6-unsaturated fatty acids/lipids, such as, for 
example, gamma-linolenic acid and stearidonic acid, have been 
obtained by expression of a single desaturase, as described to 
date. Moreover, a mixture of co 3- and 10 6-f atty acids has been 

45 obtained as a rule, since ail of the A6-desaturases described to 
date converted for example not only linoleic acid (id 6-f atty 
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acid), but also a-linolenic acid (o3-fatty acid). 

Particularly suitable microorganisms for the production of PUFAs 
are microorganisms such as Thraustochytrium species or 
5 Schizochytrium species, algae such as Phaeodactylum tricornutum 
or Crypthecodinium species, ciliates such as Stylonychia or 
Colpidium, fungi such as Mortierella, Entomophthora or Mucor. 
Strain selection has made possible the development of mutant 
strains of the microorganisms in question which produce a series 

10 of desirable compounds, including PUFAs. The mutation 

and selection of strains with an improved production of a 
particular molecule r such as the polyunsaturated fatty acids, is, 
however, a time-consuming and difficult procedure. This is why 
recombinant methods are preferred whenever possible, as described 

15 above. However, only limited amounts of the desired 

polyunsaturated fatty acids such as DPA, EPA or ARA can be 
produced with the aid of the abovementioned microorganisms, these 
unsaturated fatty acids being obtained, as a rule, as fatty acid 
mixtures of, for example, EPA, DPA and DHA, depending on the 

20 microorganism used. 

As an alternative, the production of fine chemicals can suitably 
be carried out on a» large scale vi.a the production in plants 
which have been developed such that they produce the 

25 abovementioned PUFAs. Plants which are particularly suited to 

this purpose are oil crops, which comprise large amounts of lipid 
compounds, such as oilseed rape, canola, linseed, soyabeans, 
sunflowers, borage and evening primrose. However, other crop 
plants which comprise oils or lipids and fatty acids are also 

30 well suited, as mentioned in the extensive description of the 
present invention. Conventional breeding has given rise to a 
series of mutant plants which produce a spectrum of desirable 
lipids and fatty acids , cof actors and enzymes. However,, the 
selection of new plant varieties with improved production of a 

35 particular molecule is a time-consuming and difficult procedure 
or is indeed impossible if the compound does not occur naturally 
in the plant in question, as in the case of polyunsaturated Cia^, 
C2o-fatty acids and C 2 2-fatty acids and those with longer carbon 
chains . 

40 

Owing to the positive properties of unsaturated fatty acids, 
there has been no lack of attempts in the past to make available 
these genes which are involved in the synthesis of fatty acids or 
triglycerides for the production, in various plants, of oils with 
45 a modified content of polyunsaturated fatty acids. However, it 
has been impossible as yet to produce longer-chain 
polyunsaturated C 2 o- and/or C 22 -fatty acids such as EPA or ARA in 
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plants. - 

It was therefore an object to develop a method for the production 
of polyunsaturated fatty acid esters and/or free polyunsaturated 
fatty acids with at least three double bonds in the fatty acid 
molecule. This object was achieved by the method according to the 
invention for the production of compounds of the general formula 
I: 



15 

in transgenic plants with a content of at least 1% by weight 
based on the total fatty acids, which process comprises the 
following steps: _ 

20. aj introducing, into a plant, at least one nucleic acid sequence 
which encodes a polypeptide with a A6-desaturase activity; 

and - - ; "/ ... / . ■ / - ~ ■- 

b) introducing at least one; second nucleic acid sequence; which " 
25 encodes a polypeptide with a A6-elongase activity; and, 

c) if appropriate , introducing a third nucleic acid sequence 
which encodes a polypeptide with a A5-desaturase activity; 

d) followed by growing and harvesting the plants; and 

30 

where the variables and substituents in the formula I have the 
following meanings: 

R 1 = -OH, coenzyme A (thioester), phosphatidylcholine, 
35 phosphatidylethanolamlne, phoshatidylglycerol, 

diphosphatidylglycerol, phosphatidylserine, 
phosphatidylinositol, sphingolipid, glycoshingolipid or a 
radical of the following general formula II 



40 



H 2 C-0-R 2 



H< 
H 



i-O-K' (1I> 



45 
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R 2 * H, phosphatidylcholine-, phosphatidylethanolamine-, 
phosphat idylglycerol-, diphosphatidylglycerol-, 
phosphat idylserine- , phosphatidylinositol- , shingolipid- , 
glycoshingolipid-, glycoshingolipid- or saturated or 
5 unsaturated C2-C24— alkylcarbonyl- , 

R 3 - H, saturated or unsaturated C2— C24— alkylcarbonyl-, or 

r2 and R 3 independently of one another represent a radical of the 
10 general formula la 



15 



CH=CH— CH I" 



H 



-CH-, 



(la), 



n = 3 , 4 or 6 , m «= 3 , 4 or 5 and p 
= 4 or 5 and p » 0 or 3 . 



0 or 3, preferably n = 3, m 



20 



Ri in the compounds of the formula I denotes -OH (hydroxyl-), 
acetyl-coenzyme phosphatidylcholine- , 

phosphatidylethanolamine- , phoshat idylglycerol- , 
diphosphatidylglycerol- , phosphatidylserine-, 
25 phosphat idylinositol-, sphingolipid-, glycoshingolipid- or a 
radical of the following general formula II 



H 2 C— O — R 2 



30 



H' 
H 



0-R 3 



(II) 



The abovementioned radicals for R 1 are in each case bound to the 
35 compounds of the formula I in the form of esters or thioesters. 

R 2 in the compounds of the formula II denotes hydrogen, 
phosphatidylcholine-, phosphatidylethanolamine-, 
phoshatidylglycerol-, diphosphatidylglycerol-, 
40 phosphatidylserine-, phosphat idylinositol-, shingolipid-, 

glycoshingolipid-, glycoshingolipid- or saturated or unsaturated 
C2— C24— alkylcarbonyl- • 

Unsaturated or saturated C2-C22-alkylcarbonyl which may be 
45 mentioned are radicals such as ethylcarbonyl, n-propylcarbonyl, 
n-butylcarbonyl, n-pentylcarbonyl, n-hexylcarbonyl , 
n-heptylcarbonyl , n-octylcarbonyl r n-nonylcarbonyl , 
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n-decy lcarbonyl , n-undecylcarbony 1 , n—dodecy lcarbonyl , 
n—tridecy lcarbonyl , n~tetradecy lcarbonyl , n— pentadecylcarbony 1 , 
n—hexadecy lcarbonyl, n— heptadecy lcarbonyl, n— octadecylcarbonyl , 
n—nonadecy lcarbonyl , n— eicosylcarbonyl , n-docosanylcarbonyl or 
5 n-tetracosanylcarbonyl, all of which may comprise one or more 
double bonds . Preferred are saturated or unsaturated 
Cio-C22-alkylcarbonyl radicals such as n-decylcarbonyl, 
n-undecy lcarbonyl , n— dodecylcarbonyl , n—tridecy lcarbonyl , 
n— tetr adecy lcarbonyl , n— pentadecylcarbonyl f n— hexadecylcarbonyl , 

10 n— heptadecylcarbonyl, n— octadecy lcarbonyl, n— nonadecylcarbonyl , 
n— eicosylcarbonyl, n-docosany lcarbonyl or n-tetracosany lcarbonyl, 
all of which comprise one or more double bonds. Especially 
preferred are saturated or unsaturated Cio— c 22— a *lkyl car b° n Yl 
radicals such as Ci<j— alkylcarbonyl, On— alkylcarbonyl, 

15 C i2— alkylcarbonyl, Ci 3 — alkylcarbonyl, C i4 — alkylcarbonyl, 
C i6— alkylcarbonyl, Cis— alkylcarbonyl, C20— alkylcarbonyl, 
C22— alkylcarbonyl or C24— alkylcarbonyl radicals, all of which 
comprise one or more double bonds. Very especially preferred are 
saturated or unsaturated Ci6— C22— alky lcarbonyl radicals such as 

20 Ci 6 -alky lcarbonyl, Cis— alkylcarbonyl , C20— alkylcarbonyl or 

C 2 2— alkylcarbonyl radicals, all of which comprise one or more 
double bonds. Preferably, the abovementioned radicals comprise 
two, three, four or. five double bonds. Especially preferably, the 
radicals comprise three, four or five double bonds. Very " 

25 especially preferred are Cxe— alkylcarbonyl radicals which comprise 
one, two, three or four, double bonds and C20— alkylcarbonyl 
radicals which comprise three, four or five double bonds. All of 
the abovementioned radicals are derived from the corresponding 
fatty acids. 

30 

R 3 denotes hydrogen or saturated or unsaturated 
C 2 — C24— al ky lcarbonyl . 



R 2 and R 3 in the compounds of the formula II independently of one 
another furthermore denote a radical of the general formula la 



35 



CH; 



:CH 




CH 3 



(la), 



45 where n-3, 4 or 6, m = 3, 4 or 5 and p = 0 or 3, preferably n = 
3 , m - 4 or 5 and p « 0 or 3 . 
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The abovementioned radicals R 1 , R 2 and R 3 may also have attached 
to them substituents such as hydroxyl or epoxy groups or else 
comprise triple bonds . 

5 The nucleic acid sequences used in the method according to the 
invention are isolated nucleic acid sequences which encode 
polypeptides with AS—, A6— desaturase or A6— elongase activity. 

The compounds of the formula I which are produced in this method 
10 advantageously comprise a mixture of differing radicals R 1 , R 2 or 
R 3 which can be derived from differing glycerides. Moreover, the 
abovementioned radicals can be derived from different fatty acids 
such as short-chain fatty acids having 4 to 6 carbon atoms, 
medium-chain fatty acids having 8 to 12 carbon atoms or 
15 long-chain fatty acids having 14 to 24 carbon atoms? the 
long-chain fatty acids are preferred. 

The method according to the invention advantageously gives fatty 
acid esters (= compounds of the formula I) with polyunsaturated 

2Q Ci 8 — , C 2 o- and/or C22-fatty acid molecules with at least two double 
bonds in the fatty acid ester. Preferably, these fatty acid 
molecules comprise three, four or five double bonds and 
advantageously lead to the synthesis of y-linolenic acid (= GLA, 
CIS^* 6 - 9 ' 1 *) , stearidonic acid (» SDA, C18 ; 4* 6 < 9 * «, is> » 

25 dihomo-Y-linolenic acid (= DGLA, 20 : 3 A8 ' 11 * 14 ) r eicosatetraenoic 
acid (= ETA, C20:4 A5 ' 8 ' 11 ' 1 *) , arachidonic acid (ARA) , 
eicosapentaenoic acid (EPA) or their mixtures, preferably EPA 
and/ or ARA. 

30 The fatty acid esters with polyunsaturated Cia— , C20- and/or 

C22-f*tty acid molecules can be isolated from the organisms which 
have been used for the production of the fatty acid esters in the 
form of an oil or lipid, for example in the form of compounds 
such as sphingolipids, phosphoglycerides, lipids, glycolipids 

35 such as glycoshingolipid, phospholipids such as 

phosphatidylethanolamine, phosphatidylcholine, phoshatidylserine, 
phosphatidylglycerol, phosphatidylinositol or 
diphosphatidylglycerol , monoacylglycerides , diacylglycer ides , 
triacylglycerides or other fatty acid esters such as the 

40 acetyl-coenzyme A esters which comprise the polyunsaturated fatty 
acids having at least two, preferably three, double bonds. In 
addition to these esters, the polyunsaturated fatty acids are 
also present in the plants as free fatty acids or bound in other 
compounds. As a rule, the different abovementioned compounds 

45 (fatty acid esters and free fatty acids) are present in the plant 
in an approximate distribution of 80 to 90% by weight of 
triglycerides, 2 to 5% by weight of diglycerides, 5 to 10% by 
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weight of monoglycerides, 1 to 5% by weight of free fatty acids, 
2 to 8% by weight of phospholipids, the total of the different 
compounds making 100% by weight. 

5 When the compounds of the general formula I are produced in the 
method according to the invention, they are produced in a content 
of at least 1% by weight, advantageously at least 2% by weight, 
preferably at least 3% by weight, especially preferably at least 
5% by weight, very especially preferably at least 10% by weight 

10 based on the total of the fatty acids in the transgenic plant. 
Since, in the method according to the invention, the starting 
compounds linoleic acid (C18:2) and/or linolenic acid (C18:3) 
undergo several reaction steps, the end products of the method, 
such as, for example, arachidonic acid ( ARA) or eicosapentaenoic 

15 acid (EPA) are not obtained as pure products, but there are 

always minor amounts of the precursors still present in the end 
product. If both linoleic acid and linolenic acid -are present in 
the original plant, the end products such as ARA and EPA are 
present as mixtures. The precursors;, should advantageously not 

20. amount to more than 20% by weight , -preferably not more than 15% 
by weight, especially preferably not more than 10% by weight, 
very especially preferably not more than 5% by weight, based on; 
the amount of the end product in question. Advantageously, the; ~ 
end products which are produced in the method according to the 

25~ invention in a transgenic plant are only ARA or only EPA, either- 
bound or as free acids (see- compounds of the general formula I)V 
If both compounds (ARA + EPA) are produced simultaneously, they 
are advantageously prduced in a ratio of at least 1:2 (EPA:ARA), 
advantageously at least 1:3, preferably 1:4, especially 

30 preferably 1:5. 

Suitable organisms for the production in the method according to 
the invention are, in principle, all plants such as mosses, 
algae, dicots or monocots. it is advantageous to use, in the 

35 method according to the invention, organisms which belong to the 
oil-producing organisms, i.e. which are used for the production 
of oils, such as algae like Crypthecodinium, Phaeodactylum or 
plants, in particular plants, preferably oil crops, which 
comprise large amounts of lipid compounds, such as peanut, 

40 oilseed rape, canola, sunflower, safflower, poppy, mustard, hemp, 
castor-oil plant, olive, sesame, Calendula, Punica, evening 
primrose, verbascum, thistle, wild roses, hazelnut, almond, 
macadamia, avocado, bay, pumpkin /squash, linseed, soybean, 
pistachios, borage, trees (oil palm, coconut or walnut) or field 

45 crops such as maize, wheat, rye, oats, triticale, rice, barley, 
cotton, cassava, pepper, Tagetes, Solanaceae plants such as 
potato, tobacco, eggplant and tomato, Vicia species, pea, alfalfa 



4 
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or bush plants (coffee, cacao, tea), Salix species and perennial 
grasses and fodder crops. Preferred plants according to the 
invention are oil crops such as peanut , oilseed rape, canola, 
sunflower, saf flower, pea, mustard, hemp, castor-oil plants, 
5 olive, Calendula, Punica, evening primrose, pumpkin/squash, 
linseed, soybean, borage, trees (oil palm, coconut). Especially 
preferred are plants which are high in C18:2- and/or C18:3-fatty 
acid, such as sunflower, safflower, tobacco, verbascum, sesame, 
cotton, pumpkin/squash, poppy, evening primrose, walnut, linseed, 
10 hemp, thistle or safflower. Very especially preferred are plants 
such as safflower, sunflower, poppy, evening primrose, walnut, 
linseed or hemp. 

Owing to the enzymatic activity of the nucleic acids used in the 
15 method according to the invention, which encode polypeptides with 
A5— , A6— desaturase~or A6-elongase activity, different compounds 
of the formula X can be produced. Depending on the choice of the 
plant used for the method according to the invention, mixtures of 
the different compounds of the general formula I or individual 
20 compounds, such as EPA or ARA, can" be produced in free or bound 
form. Depending on the fatty acid composition which prevails in 
the original plant (C18:2- or C18:3-fatty acids), this gives 
compounds of the general formula I. which are derived from 
C18:2-fatty acids, such as GLA-, DGLA- or ARA-comprising 

25 compounds of the formula I, or compounds which are derived from 
C18;3-fatty acids, such as SDA-, ETA- or EFA-coniprising compounds 
of the formula X. If linoleic acid (= LA, C18:2 A *'«) is the only 
unsaturated fatty acid present in the plant used for the method, 
only GLA, DGLA and ARA can be formed as products of the method, 

30 all of which can be present as free fatty acids or in bound form. 
If a-linolenic acid {= ALA, C18 : 3* 9 ' 12 ' 15 ) is the only unsaturated 
fatty acid present in the plant used in the method, for example 
such as in linseed, only SDA, ETA and EPA can be formed as 
products of the method, all of which can be present as free fatty 

35 acids or in bound form, as described above. By modifying the 
activity of the enzymes implicated in the synthesis <A5— , 
A6~desaturase and A6-elongase ) , or by introducing only the first 
two genes ( A6-desaturase and A6-elongase) of the synthetic 
cascade, it is possible to produce in a targeted manner only 

40 individual products in the abovementioned plants (see Figure I). 
Due to the activity of the enzymes A6— desaturase and A6— elongase, 
GLA and DGLA, or SDA and ETA, respectively, form, depending on 
the original plant and the unsaturated fatty acid. DGLA or ETA, 
respectively, or mixtures of these are formed preferentially. If 

45 the enzyme AS—desaturase is additionally introduced into the 
plant, ARA or EPA are additionally formed. It is advantageous 
only to synthesize ARA or EPA or their mixtures, depending on the 
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fatty acid which is present in the plant and -which acts as 
starting material for the synthesis. Since biosynthetic cascades 
are involved, the end products in question are not present in 
pure form in the plants . There are always minor amounts of the 
5 precursor compounds present in the end product. These minor 
amounts amount to less than 20% by weight, advantageously less 
than 15% by weight, especially advantageously less than 10% by 
weight, very especially advantageously less than 5, 4, 3, 2 or 1% 
by weight, based on the end product DGLA, ETA or their mixtures, 
10 or ARA, EPA or their mixtures, respectively. 

For the purposes of the method according to the invention, 
transgenic plants are also understood as meaning plant cells, 
plant organs or intact plants which are grown for the production 

15 of compounds of the general formula I. Growing is understood as 
meaning for example culturing of the transgenic plant cells, 
plant tissue or plant organs on a nutrient medium- or the intact 
plant on or in a substrate, for example in hydroponic culture or 
on an arable .soil. . ~ -- 

20 ~- - • - - - - •' 

Nucleic acids which can be used in the method according to the 
invention are, in principle, all those which encode polypeptides 
with A5— A6— desaturase- or A6— elongase activity. These nucleic 
acids are advantageously derived from plants such as algae, such 7 

25 as Isochrysis or Crypthecodinium, diatoms such as Phaeodactylum, • 
mosses such as Physcomitrella, Ceratodon or higher plants such as 
the primulaceae, such as Aleuritia, Calendula stellata, 
Osteospermum spinescens or Osteosperznum hyoseroides, 
microorganisms such as fungi, such as Aspergillus, 

30 Thraustochytrium, Phytophtora, Entomophthora, Mucor or 
Mortierella, yeasts or animals such as nematodes, such as 
Caenorhabditis , insects or humans. The A5— , A6— desaturase or 
A6— elongase genes are advantageously derived from fungi or from 
plants such as algae or mosses, preferably from plants. 

35 

It is advantageous to in the method according to the invention, a 
nucleic acid sequence selected from the group of the in SEQ ID 
NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 31 
or their derivative or homologs which encode polypeptides which 

40 retain the enzymatic activity. These sequences, individually or 
in combination, are cloned into expression constructs; these 
expression constructs are represented in the sequences SEQ ID NO: 
33-37. These expression constructs make possible an optimal 
synthesis of the compounds of the general formula I produced in 

45 the method according to the invention. 
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in a preferred embodiment, the method furthermore comprises the 
step of obtaining a cell which comprises the nucleic acid 
sequences which are used in the method and which encode a AS- or 
A6-desaturase and a A6-elongase, where a cell is transformed with 
5 the nucleic acid sequence, a gene construct or a vector which 
bring about the expression of the A5-, A6-desaturase or 
A6-elongase nucleic acid, alone or in combination. In a further 
preferred embodiment, the method furthermore comprises the step 
of obtaining the fine chemical from the culture. The cell 
10 generated thus is advantageously a cell of an oil crop such as, 
for example, peanut, oilseed rape, canola, linseed, soybean, 
saf flower, hemp, sunflowers or borage. 

A transgenic plant is understood as meaning, for the purposes of 
15 the invention, that the nucleic acids used in the method are not 
at their natural locus in the genome of an organism; in this 
context, the nucleic acids can be expressed homologously or 
heterologously. However, transgenic also means that, while the 
nucleic acids according to the invention are at their natural 
20 locus in the genome of ah organism, the sequence has been 
modified in comparison with the natural sequence and/or the 
regulatory sequences of the natural sequences have been modified. 
Preferably, transgenic is understood as meaning that the nucleic 
acids according to the invention are not expressed at their 
25 natural locus in the genome, that is to say that homologous or 
preferably heterologous expression of the nucleic acids takes 
place. Preferred transgenic plants are the oil crops. 

Transgenic plants which comprise the compounds of the formula I 

30 which have been synthesized in the method according to the 
invention can be marketed directly without isolation of the 
compounds which have been synthesized. Plants are understood as 
meaning, in the method according to the invention, all plant 
parts, plant organs such as leaf, stem, root, tuber or seeds, or 

35 all of the plant. In this context, the seed comprises all parts 
of the seed such as the seed coats, epidermis cells and seed 
cells, endosperm or -embyro tissue. However, the compounds 
produced in the method according to the invention can also be 
isolated from the plants in the form of their oils, fat, lipids 

40 and/or free fatty acids. Compounds of the formula I which have 
been produced by this method can be harvested by harvesting the 
organisms either from the culture in which they grow or from the 
field. This can be done by pressing or extracting the plant 
parts, preferably the plant seeds. In this context, the oils, 

45 fats, lipids and/or free fatty acids can be obtained by pressing 
by what is known as cold-beating or cold-pressing, without 
supplying heat. The plant parts, specifically the seeds, are 
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beforehand comminuted, steam-treated or toasted in order to 
facilitate their disruption. The seeds pretreated thus can 
subsequently be pressed or else extracted with solvents such as 
warm hexane. The solvent is subsequently removed. In this manner, 
5 more than 96% of the compounds produced in the method can be 
isolated. The resulting products are subsequently processed 
further, i.e. refined. Here, the plant mucilages and turbid 
matter are first. What is known as degumming can be performed 
enzymatically or, for example, chemico-physically by adding acid 

10 such as phosphoric acid. The free fatty acids are subsequently 
removed by treatment with a base, for example sodium hydroxide 
solution. The resulting product is washed thoroughly with water 
to remove the alkali remaining in the product, and dried. To 
remove the coloring matter which still remains in the product, 

15 the products are bleached, for example using bleaching earth or 
active charcoal. At the end, the product is deodorized, for 
example by using steam. 

_ The PUFAs produced by this method are preferentially C 18 - or 
20 C2o-22- f atty acid molecules having "at least two double bonds in 
the fatty acid molecule, preferably three, four, in combination 
with, a: further elpngases and -/a A4^desaturase five or six double ' 
bonds. These . Cia-. or ■: C 2 0r22 ; ""-f att: Y molecules can be isolated 
from the. organism in the form of anvoily lipid or* a free fatty 
25 acid- Suitable organisms are/- for; example, those which have been 
mentioned above. Preferred organisms are transgenic plants." 

In a preferred embodiment, oils, lipids or fatty acids or 
fractions of these which have been produced by the 
30 above-described method are especially preferably oil, lipid or a 
fatty acid composition which comprise PUFAs or which originate 
from transgenic plants. 

A further embodiment according to the invention is the use of the 
35 oil, lipid or the fatty acid composition in foods, feeds, 
cosmetics or pharmaceuticals. 

The terra "oil" or "fat" is understood as meaning a fatty acid 
mixture which comprises unsaturated, saturated, preferably 

40 esterified fatty acid(s). It is preferred that the oil or fat has 
a high content of unsaturated, unconjugated esterified fatty 
acid(s), in particular linoleic acid, y-linolenic acid, 
dihomo-7-linolenic acid, arachidonic acid, ct~linolenic acid, 
stearidonic acid, eicosatetraenoic acid or eicosapentaenoic acid. 

45 The amount of unsaturated esterified fatty acids is preferably 

approximately 30%, with an amount of 50% being more preferred and 
an amount of 60%, 70%, 80% or more being even more preferred. For 
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identification purposes, it is possible, for -example, to 
determine the amount of fatty acid by gas chromatography after 
converting the fatty acids into the methyl esters by means of 
transesterification. The oil or fat can comprise various other 
5 saturated or unsaturated fatty acids, for example calendulxc 
acid, palmitic acid, stearic acid, oleic acid and the Ixke. The 
amount of the various fatty acids in oil or fat can vary in 
particular as a function of the original plant. 

10 The compounds of the formula I which are produced in the method 
Ind which comprise polyunsaturated fatty acids havxng at least 
two double bonds are sphingolipids , phosphoglycerxdes, lxpxds, 
glycolipids, phospholipids, monoacylglycerol, diacylglycerol, 
triacylglycerol or other fatty acid esters. 

" The polyunsaturated fatty acids which are present can be 

liberated from the compounds of the general formula I produced 
thus in the method according to the invention for example vxa 
treatment with alkali, for example aqueous KOH or KaOH, or acxd 

20 hydrolysis, advantageously in the presence of an alco o ucj as 
methanol or ethanol, or via enzymatic cleavage and - vxa, 
for example, phase separation and subsequent acxdxf xcatxon 
for example, H 2 SQ 4 .. However, the fatty acids can also be liberated 
directly without the above -de scribed processxng. 

25 After they have been introduced into plant cells or plants, the 
nucleic acids used in the method can either be located on a 
separate plasmid or integrated into the genome of the host cell, 
in the case of integration into the genome, the integration can 

30 be random or be effected by recombination in such a way that the 
native gene is replaced by the copy being introduced whereby the 
production of the desired compound by the cell xs modulated or 
by using a gene in trans, so that the gene is linked operably 
with a functional expression unit which comprises at least one 

35 sequence which ensures the expression of a gene and at least one 
sequence which ensures the polyadenylation of a functxonally 
transcribed gene. The nucleic acids are advantageously xntroduced 
into the plants via multiexpression cassettes or constructs for 
the multiparallel seed-specific expression of genes. 

40 Mosses and algae are the only known plant systems which produce 
substantial amounts of polyunsaturated f atty acids sue * * s 
arachidonic acid { ARA) and/or eiccsapentaenoxc acxd (EPA) and/or 
docosahexaenoic acid ( DHA) . Mosses comprise PUFAs xn membrane 

45 lipids, while algae, organisms which are related to algae and 
some fungi also accumulate substantial amounts of PUFAs xn the 
triacylglycerol fraction. This is why nucleic acxd molecules 
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which are isolated from such strains which also accumulate PUFAs 
in the triacylglycerol fraction are especially advantageously 
suitable for the method according to the invention and thus for 
the modification of the lipid and PUFA production system in a 
5 host, in particular plants, such as oil crops, for example 

.oilseed rape, canola, linseed, hemp, soybean, sunflowers, borage. 
They can therefore be used advantageously in the method according 
to the invention. 

10 It has been possible to date to demonstrate that a trienoic acid 
with Cis carbon chain can be produced with the aid of desaturases. 
These methods which are known from the literature claim the 
production of Y-!i no l e nic acid. However, nobody has as yet been 
able to demonstrate the production very long-chain 

15 polyunsaturated fatty acids (with C 2 o- and longer carbon chain and 
of trienoic acids and higher unsaturated types) by modified 
plants alone. 

To. produce the longer-chain PUFAs according to the -invention, -the 

20 polyunsaturated Cig-fatty acids must first be desaturated by the 
enzymatic activity of a desaturase and subsequently elongated by 
at least two carbon atoms via an elongase « After one elongation 
cycle, this . enzyme activity gives. p2&- fatty acids, and after two - 
or "three elongation cycles C22- or C24-fatty acids ^ The activity 

25 of the desaturases and elongases Used method according to the 
invention, gives by preference Cis-/ C20- anc */° r c 22-f atty acids 
having at least two double bonds in the fatty acid molecule, by 
preference three, four or five double bonds, especially 
preferably Cis- and/or C2o-^atty acids with at least two double 

30 bonds in the fatty acid molecule, prefereably with three, four or 
five double bonds in the molecule. After a first desaturation and 
the elongation have taken place, further desaturation steps such 
as, for example, in A5-position, may take place. Especially 
preferred products of the process according to the invention are 

35 arachidonic acid and eicosapentaenoic acid. The Ci3-fatty acids 
with at least two double bonds in the fatty acid can be elongated 
by the enzymatic activity according to the invention in the form 
of the free fatty acid or in the form of the esters, such as 
phospholipids, glycolipids, sphingolipids , phosphoglycerides , 

40 monoacylglycerol, diacylglycerol or triacylglycerol. 

Using cloning vectors in plants and in the transformation of 
plants like those which are published and cited in: Plant 
Molecular Biology and Biotechnology (CRC Press, Boca Raton, 
45 Florida), Chapter 6/7, pp, 71-119 (1993); F.F. White, Vectors for 
Gene Transfer in Higher Plants; in: Transgenic Plants, Vol* 1, 
Engineering and Utilization/ Eds.: Kung and R. Wu, Academic 
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Press, 1993, 15-38; B. Jenes et al . , Techniques for Gene 
Transfer, in: Transgenic Plants, Vol. 1, Engineering and 
Utilization, Eds.: Kung and R. Wu, Academic Press (1993), 
128-143; Potrykus, Annu. Rev. Plant Physiol. Plant Molec. Biol. 
5 42 (1991), 2 05-225)), the nucleic acids can be used for the 
recombinant modification of a broad spectrum of plants so that 
this plant becomes a better or more efficient producer of one or 
more lipid-derived products, such as PUFAs. This improved 
production or production efficiency of a lipid-derived product, 
10 such as PUFAs, can be brought about by a direct action of the 
manipulation or an indirect action of this manipulation. 

A series of mechanisms exist by means of which the modification 
of a desaturase protein according to the invention can have a 

15 direct effect on the yield, production and/or production 
efficiency of a fine chemical from an oil crop plant or a 
microorganism, owing to a modified protein. The number or 
activity of the desaturase protein or desaturase gene and of gene 
combinations of desaturases and elongases can be increased, so 

20 that larger amounts of these compounds are produced de novo since 
the organisms lacked this activity and ability to biosynthesize 
them prior to introduction of the gene in question. This also 
applies analogously to the combination with further desaturases 
or elongases or further enzymes of the lipid metabolism. The use 

25 of various divergent sequences, i.e. sequences which differ at 
the DNA sequence level, may also be advantageous, or else the use 
of promoters for gene expression which makes possible a different 
temporal gene expression, for example as a function of the degree 
of maturity of the seed or oil-storing tissue. 

30 

The introduction of a desaturase and/or elongase gene, or several 
desaturase and elongase genes, into an organism, alone or in 
combination with other genes into a cell can not only increase 
the biosynthesis flux toward the end product, but also increase, 

35 or generate de novo, the corresponding triacylglycerol 

composition. Likewise r the number or activity of other genes 
which participate in the import of nutrients required for the 
biosynthesis of one or more fine chemicals (for example fatty 
acids, polar and neutral lipids) can be increased, so that the 

40 concentration of these precursors, cof actors or intermediates 
within the cells or within the storage compartment is increased, 
thus further increasing the ability of the cells to produce PUFAs 
as described hereinbelow. Fatty acids and lipids themselves are 
desirable as fine chemicals; by optimizing the activity or 

45 increasing the number of one or more desaturases and/or elongases 
which participate in the biosynthesis of these compounds, or by 
destroying the activity of one or more desaturases which 
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participate in the breakdown of these compounds, it can be 
possible to increase the yield, production and/or efficiency of 
the production of fatty acid and lipid molecules from plants. 

5 The isolated nucleic acid molecules used in the process according 
to the invention encode proteins or parts of these , the proteins, 
or the individual protein or parts thereof, comprising an amino 
acid sequence with sufficient homology with an amino acid 
sequence of the sequence SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 

10 18, 20, 22, 24, 26, 28, 30 or 32 so that the protein or the part 
thereof retains a desaturase or elongase activity. Preferably, 
the protein or the part thereof which is encoded by the nucleic 
acid molecule has its essential enzymatic activity and the 
capability of being implicated in the metabolism of compounds 

15 which are required for the synthesis of plant cell membranes or 
in the transport of molecules across these membranes. 
Advantageously, the protein encoded by the nucleic acid molecules 
is at least approximately 50%, preferably at least approximately 
60% and more preferably at -least approximately 70%, 80% or 90% 

20 and most preferably at least approximately 95%, 96%, 97%, 98%, 
99% or more homologous to an amino acid sequence of the sequence 
SEQ ID NO: 2,4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, ^3 0 
- or 32* Preferably,- -the protein is a full-length protein which is 
essentially homologous in parts to a total amino acid sequence of 

25 SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 
-or 32 (which is the result of the open reading frame shown in 
SEQ ID NO; 1, 3 r 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 
or 31). For the purposes of the invention, homology and 
homologous are understood as meaning identity or identical. 

30 

The term essential enzymatic activity of the desaturases and the 
elongase used is understood as meaning that, in comparison with 
the proteins /enzymes encoded by the sequences with SEQ .ID NO: 1, 
3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 31, they 

35 retain at least an enzymatic activity of at least 10%, preferably 
20%, especially preferably 30% and very especially 40% and can 
thus be implicated in the metabolism of compounds which are 
required for the synthesis of fatty acids in a plant cell or in 
the transport of molecules across membranes, meaning desaturated 

40 Cis- or C20-22- carbon chains with double bonds at at least two, 
advantageoulsy three, four or five positions. 

Nucleic acids which can advantageously be used in the process 
originate from fungi or plants such as algae or mosses of the 
45 genera Physcomitrella, Thraustochytrium r Phytophtora, Ceratodon, 
Isochrysis, Aleurita, Muscarioides , Mortierella, Borago, 
Phaeodactylum, Crypt hecodinium or from nematodes such as 
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Ceanorhabditis, specifically from the genera and species 
Physcomitrella patens, Phytophtora infestans, Ceratodon 
purpureus, Isochrysis galbana, Aleurita farinosa, Muscarioides 
viallii, Mortierella alpina, Borago officinalis, Phaeodactylum 
5 tricormutum or Ceanorhabditis elegans . 

As an alternative, the isolated nucleotide sequences used can 
encode desaturases or elongases which hybridize, for example 
under stringent conditions, with a nucleotide sequence of the SEQ 
10 ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 
31. 

The nucleic acid sequences used in the process are advantageously 
introduced in an expression cassette which makes possible the 
15 expression of the nucleic acids in plants. 

Advantageous expression cassettes are shown in SEQ ID NO: 33 to 
37. Here, the nucleic acid sequences encoding the desaturases 
and/or the elongases are linked operably with one or more 

20 regulatory signals, advantageously for enhancing gene expression. 
These regulatory sequences are intended to make possible the 
specific expression of genes and of protein expression. Depending 
on the host organism, this may mean, for example, that the gene 
is expressed and/or overexpressed only after induction, or else 

25 that it is immediately expressed and/or overexpressed. For 

example, these regulatory sequences take the form of sequences to 
which inductors or repressors bind and thus regulate expression 
of the nucleic acid. In addition to these novel regulatory 
sequences, or instead of these sequences, the natural regulation 

30 of these sequences before the actual structural genes may still 
be present and, if appropriate, may have been genetically 
modified so that the natural regulation has been switched off and 
the expression of the genes enhanced. However, the expression 
cassette (* expression construct = gene construct) can also be 

35 simpler in construction, that is to say no additional regulatory 
signals have been inserted before the nucleic acid sequence or 
its derivatives, and the natural promoter together with its 
regulation has not been removed. Instead, the natural regulatory 
sequence has been mutated in such a way that regulation no longer 

40 takes place and/or gene expression is enhanced. These modified 
promoters can also be placed before the natural gene alone in the 
form of part-sequences promoter together with parts of the 
nucleic acid sequences according to the invention) to enhance the 
activity. Moreover, the gene construct can advantageously also 

45 comprise one or more enhancer sequences in operable linkage with 
the promoter, which make possible an enhanced expression of the 
nucleic acid sequence. Also, additional advantageous sequences, 
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such as further regulatory elements or terminators, may be 
inserted at the 3' terminus of the DNA sequences. 
The A5— desaturase/A6— - desaturase and/or A6— elongase genes may be 
present in the expression cassette. {- gene construct) in one or 
5 more copies* Advantageously, in each case only one copy of the 
genes is present in the expression cassette- This gene construct, 
or the gene. constructs , can be expressed together in the host 
organism. In this context, the gene constructs ) can be inserted 
in one or more vectors and be present in the cell in free form or 
10 else be inserted in the genome. It is advantageous for the 

insertion of further genes in the host genome when the genes to 
be expressed are present together in one gene construct. 

In this context, the regulatory sequences or factors can, as 
15 described above, preferably have a positive effect on the gene 
expression of the genes which have been introduced, thus 
enhancing it. Thus, the regulatory elements can advantageously be 
enhanced at transcriptional level by using strong transcription 
signals such, as promoters and/or enhancers . In addition, however, 
20 ah enhancement of .-translation is also possible, for example by . 
improving the stability of the mRNA . . - " ; 

A further -.embodiment of the invention are one or more gene \ 
constructs which comprise one or more sequences which are defined 

25 by SEQ ID NO: 1/ 3, 5, 7, 9, 11, 13, 15, 17, 19, -2 1," 23, 25, 27, 
29 or 31 and which encode polypeptides of SEQ ID NO: 2, 4, 6, 8, 
10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 or 32. The 
abovementioned desaturases introduce a double bond into the A5 or 
A6 position, the substrate having one, two, three or four double 

30 bonds, Elongase ( A6-elongase) has an enzyme activity which 
elongates a fatty acid by at least two carbon atoms. The same 
applies to its homologs, derivatives or analogs which are linked 
operably with one or more regulatory signals, advantageously for 
enhancing gene expression. 

35 

Advantageous regulatory sequences for the novel process are 
present, for example, in promoters such as cos, tac, trp, tet, 
trp-tet, lpp r lac, lpp-lac, lad*' T7, T5, T3, gal, trc, ara, SP6, 
X-P R or X-P L promoter and are advantageously used in Gram-negative 

40 bacteria. Further advantageous regulatory sequences are present, 
for example, in the Gram-positive promoters amy and SP02, in the 
yeast or fungal promoters ADC1, MFa, AC, P-60, CYC1, GAPDH, TEF, 
rp28, ADH or in the plant promoters CaMV/35S [Franck et al., Cell 
21 (1980) 285— 294], PRP1 [Ward et al., Plant. Mol. Biol. 22 

45 (1993)], SSU, OCS, lib4 , usp r STLS1, B33, nos or in the ubiquitin 
or phaseolin promoter. Also advantageous in this connection are 
inducible promoters such as the promoters described in EP— A— 0 
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388 186 (benzylsulfonamide-inducible), Plant -J 2 "9^7-404 
(Gatz et al., tetracyclin-inducible) , EP-A-0 335 528 (abscisxc 
acid-inducible) or WO 93/21334 (ethanol- or cyclohexenol- 
inducible). Further useful plant promoters are the potato 
5 cytosolic FBPase promoter or ST-LSI promoter (Stockhaus et al. , 
EMBO J. 8, 1989, 2445), the Glycine max phosphorxbosyl- 
pyrophosphate amidotransf erase promoter (Genbank Access xon No. 
U87999) or the node-specific promoter described in 
249 676. Especially advantageous promoters are promoters 
10 make possible expression in tissues which are implicated xn fatty 
"xa biosynthesis. Very especially advantageous are seed-specxfxc 
promoters, such as the USP promoter in acco ;; danc * "^J**^ 
specification, but also other promoters such as the LeB4 , DC3, 
Phaseolin or napin promoter. Further especially 
15 promoters are seed-specific promoters whxch can be used for 
Lnocots or dicots "and which are described in US 5,608,152 
ToTlseea rape napin promoter), W0 98/45461 (Arabidopsis oieosxn 
promoter), US 5,504,200 (Phaseolus vulgaris phaseolxn promoter), 
WO 91/13980 (Brassxca Bce4 promoter) described by Baeumlexn et 
20 al Plant J;< 2, 2, 1992 233-239 (LeB4 promoter from a legume), 
saxd promoters being useful in dicots. The following promoters 
are suitable for example in monocots: barley lpt-2 or lpt-1 
promoter (WO 95/15389 and WO 95/23230) barley 

and other suitable promoters which are described xn WO 99/16890 . 



25 



in principle, it is possible to use all natural promoters wxth 
their regulatory sequences like those mentioned above for the 
novel process. It is likewise possible and advantageous to use 
synthetic promoters, in addition or alone, especially when they 
30 confer seed-specific expression, such as, for example, described 
in WO 99/16890. 

in order to achieve a particularly high POFA content in 
transgenic plants, the PUFA biosynthetic genes should 
35 advantageously be expressed in oil crops in a seed-specxfxc 
manner. To this end, seed-specific promoters can be used, or 
those promoters which are active in the embryo and/or xn the 
endosperm. In principle, seed-specific promoters can be "°^ d 
from both dicots and monocots . Advantageous preferred promoters 
40 are detailed hereinbelow: USP ( = unknown seed protexn) and 
vicilin (Vicia faba) [Baumlein et al., Mol. Gen Genet ., 1991 , 
225(3)1, napin (oilseed rape) (US 5,608,152], Acyl-Carrxer 
Protein (oilseed rape) [US 5,315,001 and WO 92/18634], oleosin 
(Labidopsis thalxana) [WO 98/45461 and WO 93/20216], phaseolxn 
45 Phaseolus vulgaris, [US 5,504,200], Bce4 [WO 91/13980 legume 
B4 (LegB4 promoter) [Baumlein et al., Plant J., 2, 2, 1992], Lpt2 
and lptl (barley) [WO 95/15389 and WO95/23230 ] , seed-specxfxc 
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promoters from rice, maize and wheat [WO 99/16890], Amy32b, Amy 
6-6 and aleurain [US 5,677,474 J , Bce4 (oilseed rape) 
(US 5,530,149], glycinin (soya) [EP 571 741], phosphoenolpyruvate 
carboxylase (soya) [JP 06/62870], ADR12-2 (soya) [WO 98/08962], 
5 isocitrate lyase (oilseed rape) [US 5,689,040] or 
P-amylase (barley) [EP 781 849]. 

Plant gene expression can also be facilitated via a chemically 
inducible promoter (see a review in Gat 2 199 7, Annu. Rev. Plant 

10 Physiol. Plant Mol. Biol., 48:89-108). Chemically inducible 

promoters are particularly suitable when it is desired that gene 
expression should take place in a time-specific manner. Examples 
of such promoters are a salicylic-acid-inducible promoter (WO 
95/19443), a tetracyclin-inducible promoter (Gatz et al . (1992) 

15 Plant J. 2, 397-404) and an ethanol-inducible promoter. 

To ensure the stable integration of the biosynthesis genes into 
the transgenic plant over a plurality of generations , each of the 
nucleic acids which encode A6-desaturase, A5-desaturase, or 

20 _ A6-elongase and which are used in the process should be expressed 
under the control of a separate promoter, preferably a promoter 
which differs from the other promoters, since repeating sequence 
motifs can-_lead to. in stability- of the T-DNA, or to * recombination 
events.. In this context, the expression cassette is ~ 

25 advantageously constructed in such a way that a promoter is - 
followed by a suitable cleavage site, advantageously in a 
poly linker, for insertion of the nucleic acid to be expressed 
and, if appropriate, a terminator sequence is positioned behind 
the polylinker. This sequence is repeated several times, 

30 preferably three, four or five times, so that up to five genes 
can be combined in one construct and introduced into the 
transgenic plant in order to be expressed. Advantageously, the 
sequence is repeated up to three times ( see sequence listing SEQ 
ID NO: 33 to 37). To express the nucleic acid sequences, the 

35 latter are inserted after the promoter via a suitable cleavage 
site, for example in the polylinker. Advantageously, each nucleic 
acid sequence has its own promoter and, if appropriate, its own 
terminator sequence. However, it is also possible to insert a 
plurality of nucleic acid sequences after a promoter and, if 

40 appropriate, before a terminator sequence. Here, the insertion 
site, or the sequence, of the inserted nucleic acids in the 
expression cassette is not of critical importance, that is to say 
a nucleic acid sequence can be inserted at the first or last 
position in the cassette without its expression being 

45 substantially influenced thereby. Advantageously, different 

promoters such as, for example, the USP, IiegB4 or DC 3 promoter, 
and different terminator sequences can be used in the expression 
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cassette. However, it is also possible to use only one type of 
promoter in the cassette, which, however, may lead to undesired 
recombination events - 

5 as described above f the transcription of the genes which have 
been introduced should advantageously be terminated by suitable 
terminator sequences at the 3' end of the biosynthesic genes 
which have been introduced (after the stop codon) . An example of 
a sequence which can be used in this context is the 0CS1 
10 terminator sequence. As is the case with the promoters, different 
terminator sequences should be used for each gene. 

As described above, the gene construct can also comprise further 
genes to be introduced into the organisms* It is possible and 

15 advantageous to introduce into the host organisms, and to 

express, regulatory genes such as genes for inductors, repressors 
or enzymes which, owing to their enzyme activity, engage in the 
regulation of one or more genes of a biosynthesis pathway. These 
genes can be of heterologous or of homologous origin. Moreover, 

20 further biosynthesis genes of the "fatty acid or lipid metabolism 
can advantageously be present in the nucleic acid construct, or 
gene construct; however, these genes can also be present on one 
or more further nucleic acid constructs. A biosynthesic gene of 
the fatty acid or lipid metabolism which is preferably chosen is 

25 a gene selected from the group acyl-CoA dehydrogenase ( s ) , 
acyl-ACP acyl carrier protein] desaturase(s) , acyl- ACP 
thioesterase(s) , fatty acid acyltransf erase ( s ) , fatty acid 
synthase(s), fatty acid hydroxylase (s) , acetyl-coenzyme A 
carboxylase (s) , acyl— coenzyme A oxidase (s), fatty acid 

30 desaturase(s ) , fatty acid acetylenases, lipoxygenases, 

triacylglycerol lipases, allene oxide synthases, hydroperoxide 
lyases or fatty acid elongase(s) or their combinations. 

In this context, the abovementioned desaturases can be cloned 
35 into expression cassette according to the invention in 

combination with elongases and other desaturases and employed for 
the transformation of plants with the aid of Agrobacterium. 

In this context, the regulatory sequences or factors can, as 
40 described above, have a positive effect on, preferably, the gene 
expression of the genes introduced, thus enhancing it. Thus, 
enhancement of the regulatory elements can advantageously take 
place at the transcriptional level by using strong transcription 
signals such as promoters and/or enhancers. In addition, however, 
45 enhancement of translation is also possible, for example by 

improving the stability of the mRNA. In principle, the expression 
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cassettes can be used directly for introduction into the plant, 
or else be introduced into a vectors. 

These advantageous vectors, preferably expression vectors, 
5 comprise the nucleic acid which are used in the method and which 
encode A5- or A6-desatures or A6-elonagases , or a nucleic acid 
construct, which the nucleic acid used, alone or in combination 
with further biosynthetic genes of the fatty acid or lipid 
metabolism. As used in the present context, the term "vector" 

10 refers to a nucleic acid molecule which is capable of 

transporting another nucleic acid, to which it is bound. One type 
of vector is a "plasmid", which represents a circular double- 
stranded DNA loop into which additional DNA segments can be 
ligated. Another type of vector is a viral vector, it being 

15 possible for additional DNA segments to be ligated in the viral 
genome. Certain vectors are capable. of autonomous replication 
in a host cell in which they have been introduced- (for example 
bacterial vectors with bacterial origin of replication) . Other 
vectors are advantageously integrated in the genome of a host 

20 cell ..when being introduced into,_the host cell, whereby they 
replicate together- with the host genome. Moreover, certain 
vectors are capable of governing the expression of genes with 
which .they are .operably linked. - These vectors are referred- to 
herein, as. "expression- vectors" * -Usually, .expression vectors which 

25 are suitable for. _DNA; recombination techniques take the form of 
plasmids. In the .present description, "plasmid" and "vector" can 
be used interchangeably since the plasmid is the most frequently 
used vector form. However the invention is also intended to 
comprise these other forms of expression vectors, such as viral 

30 vectors, which have similar functions. Furthermore, the term 

vector is also intended to comprise other vectors which are known 
to the skilled worker, such as phages, viruses such as SV40, CMV, 
TMV, transposons, IS elements, phasmids, phagemids, cosmids, 
linear or circular DNA. 

35 

The recombinant expression vectors which are advantageously used 
in the method comprise the nucleic acids described hereinbelow or 
the above-described gene construct in a form suitable for 
expressing these nucleic acids in a host cell, which means that 

40 the recombinant expression vectors comprise one or more 

regulatory sequences selected on the basis of the host cells to 
be used for the expression, which is linked operably with the 
nucleic acid sequence to be expressed. "Linked operably" in a 
recombinant expression vector means that the nucleotide sequence 

45 of interest is bound to the regulatory sequence(s) in such a way 
that the expression of the nucleotide sequence is possible and 
that they are bound with one another so that both sequences 
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fulfill the predicted function ascribed to the sequence (for 
example in an in-vitro transcription/translation system or in a 
host cell if the vector is introduced into the host cell). The 
term "regulatory sequence" is intended to comprise promoters, 
5 enhancers and other expression control elements (for example 
polyadenylation signals). These regulatory sequences are 
described for example in Goeddel: Gene Expression Technology; 
Methods in Enzymology 185, Academic Press, San Diego, CA (1990), 
or see: Gruber and Crosby, in: Methods in Plant Molecular Biology 

10 and Biotechnolgy, CRC Press, Boca Raton, Florida, eds . : Glick and 
Thompson, chapter 7, 89-108, including the references therein. 
Regulatory sequences comprise those which govern the constitutive 
expression of a nucleotide sequence in many types of host cell 
and those which govern the direct expression of the nucleotide 

15 sequence only in specific host cells under specific conditions. 
The skilled worker-knows that the design of the expression vector 
can depend on factors such as the choice of the host cell to be 
transformed, the expression level of the desired protein and the 
like. 

2Q 

The recombinant expression vectors used can be designed for 
expressing desaturases and elongases in prokaryotic or eukaryotic 
cells. This is advantageous since intermediate steps of vector 
construction are frequently performed in microorganisms for the 

25 sake of simplicity. For example, desaturase and/or elongase genes 
can be expressed in bacterial cells, insect cells (using 
baculovirus expression vectors), yeast cells and other fungal 
cells (see Romanos, M.A., et al. (1992) "Foreign gene expression 
in yeast: a review-, Yeast 8:423-488; van den Hondel, C.A.M.J.J., 

30 et al. (1991) -Heterologous gene expression in filamentous 

fungi-, in: More Gene Manipulations in Fungi, J.w. Bennet & l.l. 
Insure, ed-, pp. 396-428: Academic Press: San Diego; and van den 
Hondel, C.A.K.J.J., & Punt, P.J. (1991) -Gene transfer systems 
and vector development for filamentous fungi, in: Applied 

35 Molecular Genetics of Fungi, Peberdy, J.F. r et al., ed., pp. 
1-28, Cambridge University Press: Cambridge), Algen (Falciatore 
et al., 1999, Marine Biotechnology. 1, 3:239-251), ciliates of 
the types: Holotrichia, Peritrichia, Spirotrichia, Suctoria, 
Tetrahymena, Paramecium, Colpidium, Glaucoma, Platyophrya, 

40 Potomacus, Desaturaseudocohnilembus , Euplotes, Engelmaniella and 
Stylonychia, in particular the genus Stylonychia lemnae, using 
vectors by a transformation method as described in WO 98/01572, 
and preferably in cells of multi-celled plants (see Schmidt, R. 
and willmitzer, L. (1988) "High efficiency Agrobacterlum 

45 tumefacl ens-mediated transformation of Arabidopsis thallana leaf 
and cotyledon explants" Plant Cell Rep .: 583-586 ; Plant Molecular 
Biology and Biotechnology, c Press, Boca Raton, Florida, chapter 
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6/7, pp. 71-119 (1993); F . F . White, B. Jenes et al., Techniques 
for Gene Transfer, in: Transgenic Plants, vol. 1, Engineering and 
Utilization, ed.: Kung and R. Wu, Academic Press (1993), 128-43; 
Potrykus, Annu . Rev. Plant Physiol .. Plant Molec. Biol. 42 (1991), 
5 205-225 (and references cited therein)). Suitable host cells are 
furthermore discussed in Goeddel, Gene Expression Technology: 
Methods in Enzymology 185, Academic Press, San Diego, CA (1990). 
As an alternative, the recombinant expression vector can be 
transcribed and translated in vitro, for example using T7 
10 promoter regulation sequences and T7 polymerase. 

Protein expression in prokaryotes is usually performed with the 
aid of vectors which comprise constitutive or inducible promoters 
which govern the expression of fusion proteins or nonfusion 

15 proteins. Typical fusion expression vectors are, inter alia pGEX 
(Pharmacia Biotech" Inc; Smith, D.B., and Johnson, K.S. (1988) 
Gene 67:31-40), pKAL (New England Biolabs, Beverly, MA) and pRIT5 
(Pharmacia, Piscataway, NJ) , where glutathione S-transf erase 
(GST), maltose-E-binding protein or protein A, respectively, is 

20 fused with the recombinant target "protein. 

Examples, of suitable inducible nonfusion E. coli expression ~ " 
> vectors -are, inter .alia, pTrc (Amann et al. (1988) Gene y 

• 69:301-315") and pET 1 Id (Studier et -al . , Gene Expression - ' - ' - 
25 Technology: Methods in Ehzymblogy" 1 65 , Academic Press, San Diego," 
California (1990) 60-89). The target gene expression of the pTrc r 
vector is based on the transcription of host RNA polymerase by a 
hybrid trp-lac fusion promoter. The target gene expression from 
the pET lid vector is based on the transcription of a T7-gnl0-lac 
30 fusion promoter, which is mediated by a coexpressed viral RNA 
polymerase (T7 gnl). This viral polymerase is provided by the 
host strains BL21 (DE3) or HMS174 (DE3) by a resident X prophage 
which harbors a T7 gnl gene under the transcriptional control of 
the lacuv 5 promoter. 

35 

Other vectors which are suitable for use in prokaryotic organisms 
are known to the skilled worker; these vectors are, for example 
in E. coli, pLG338, pACYC184, the pBR series such as pBR322, the 
pUC series such as pUC18 or pUC19, the Mll3mp series, pKC30, 
40 pRep4, pHSl, pHS2 r pPLc236, pMBL24, pLG200, pUR290, pIN-III^-Bl , 
Xgtll or pBdCI, in Streptomyces pIJlOl, pIJ364, pIJ702 or pIJ361, 
in Bacillus pUBllO, pC194 oder pBD214, in Corynebacterium pSA77 
or pAJ667. 

45 In a further embodiment, the expression vector is a yeast 
expression vector. Examples of vectors for expression in the 
yeast S. cerevisiae comprise pYeDesaturasecl (Baldari et al. 
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(1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz (1982) 
Cell 30:933-943) , pJRY88 (Schultz et al . (1987) Gene 54:113-123) 
and pYES2 (invitrogen Corporation, San Diego, CA) . Vectors and 
methods for construction of vectors which are suitable for use in 
5 other fungi, such as the filamentous fungi, comprise those which 
are described in detail in: van den Hondel, C. A. M.J. J., & Punt, 
P.J. (19 91) "Gene transfer systems and vector development for 
filamentous fungi, in: Applied Molecular Genetics of fungi, J.F. 
Peberdy et al- , ed., pp. 1-28, Cambridge University Press: 
10 Cambridge, or in: More Gene Manipulations in Fungi [J.W. Bennet fit 
L.L. Lasure, ed., pp. 3 96-428: Academic Press: San Diego]. 
Further suitable yeast vectors are, for example, pAG-1, YEp6, 
YEpl3 or pEMBLYe23. 

15 As an alternative, the desaturases and/or elongases can be 

expressed in insect cells using baculovirus expression vectors. 
Baculovirus vectors which are available for expressing proteins 
in cultured insect cells (for example Sf9 cells) comprise the pAc 
series (Smith et al. (1983) Mol. Cell Biol.. 3:2156-2165) and the 

20 pVL series (Lucklow and Summers (1989) Virology 170:31-39). 

The abovementioned vectors offer only a small overview over 
suitable vectors which are possible. Further plasmids are known 
to the skilled worker and are described, for example, in: Cloning 

25 Vectors (ed. Pouwels, P.H., et al., Elsevier, Amsterdam- 
New York-Oxford, 1985, ISBN 0 444 904018). Further suitable 
expression systems for prokaryotic and eukaryotic cells, see in 
the chapters 16 and 17 of Sambrook, J., Fritsch, E.F., and 
Maniatis, T. , Molecular Cloning: A Laboratory Manual, 2nd 

30 edition, Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989. 

In a further embodiment of the process, the desaturases and/or 
elongases can be expressed in single-cell plant cells (such as 

35 algae), see Falciatore et al., 1999, Marine Biotechnology 1 

(3):239-251 and references cited therein, and plant cells from 
higher plants (for example spermatophytes such as crops). 
Examples of plant expression vectors comprise those which are 
described in detail in: Becker, D., Kemper, E . , Schell, J./ and 

40 Masterson, R. (1992) "New plant binary vectors with selectable 
markers located proximal to the left border", Plant Mol. Biol. 
20:1195-1197; and Bevan, M.W. (1984) "Binary Agrobacterium 
vectors for plant transformation", Nucl. Acids Res. 12:8711-8721; 
Vectors for Gene Transfer in Higher Plants; in: Transgenic 

45 Plants, vol. 1, Engineering and Utilization, ed.: Kung and R. Wu, 
Academic Press, 1993, pp. 15-38. 
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A plant expression cassette preferably comprises regulatory 
sequences which are capable of governing the gene expression in 
plant cells and which are linked operably so that each sequence 
can fulfill its function, such as transcriptional termination, 
5 for example polyadenylation signals. Preferred polyadenylat-ion 
signals are those which originate from Agrobacterium tumefaciens 
T-DNA, such as the gene 3 of the Ti plasmid pTiACHS, which is 
known as octopine synthase (Gielen et al., EMBO J. 3 (1984) 
835f£.) or functional equivalents thereof, but all other 
10 terminators which are functionally active in plants are also 
suitable. 

Since plant gene expression is very often not limited to the 
transcriptional levels, a plant expression cassette preferably 
15 comprises other operably linked sequences such as translation 

enhancers, for example the overdrive sequence which comprises the 
5 '-untranslated leader sequence from tobacco mosaic virus, which 
increases the protein/RNA ratio (Gallie et al*, 1987, Nucl. Acids 
Research 15:8693-8711) • 

20. 

As described above, plant gene expression must be linked operably 
with a suitable promoter which performs gene expression with the 
correct timing or in a cell- or tissue-specific manner; 
Utili2able promoters are constitutive promoters (Benfey et al"./ 
25 EMBO J. 8 (1989) 2195-2202) such as those which are derived from 
plant viruses, such as 35S CAMV"(Franck et.al.,*Cell 21 (1980) 
285-294), 19S CaMV (see also US 5352605 and WO 84/02913) or plant 
promoters such as the Rubisco small subunit, which is described 
in US 4,962,028, 

30 

Other sequences which are preferred for the use for operable 
linkage in plant gene expression cassettes are targeting 
sequences , which are required for targeting the gene product into 
its relevant cell compartment (for a review see Kermode, Crit. 
35 Rev. Plant Sci. 15, 4 (1996) 285-423 and references cited 

therein), for example into the vacuole, the nucleus, all types of 
plastids, such as amyloplasts, chloroplasts, chromoplasts , the 
extracellular space , the mitochondria, the endoplasmic reticulum, 
oil bodies, peroxisomes and other plant cell compartments, 

40 

Plant gene expression can also be facilitated as described above 
via a chemically inducible promoter (for a review see Gatz 1997, 
Annu. Rev. Plant Physiol. Plant Mol. Biol., 48:89-108). 
Chemically inducible promoters are suitable in particular when it 
45 is desired that gene expression is clock-specific. Examples of 
such promoters are a salicylic acid-indue ible promoter (WO 
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95/19443), a tetracyclin-inducible promoter (-Gatz et al. (1992) 
Plant J. 2, 397-404) and an ethanol-inducible promoter. 

Other promoters which are suitable are promoters which respond to 
5 biotic or abiotic stress conditions, for example the pathogen- 
induced PRP1 gene promoter (Ward et al., Plant. Mol. Biol. 22 
(1993) 361-366), the heat-inducible tomato hsp80 promoter (US 
5,187,267), the chill-inducible potato alpha-amylase promoter (WO 
96/12814) or the wound-inducible pinll promoter (EP-A-0 375 091). 



Preferred promoters are in particular those which bring about the 
expression of genes in tissues and organs in which lipid and oil 
biosynthesis takes place, in seed cells, such as cells of the 
endosperm and of the developing embryo. Suitable promoters are 
15 the oilseed rape napin gene promoter (US 5,608,152), the Vicia 
faba USP promoter TBaeumlein et al., Mol Gen Genet, 1991, 225 
(3):459-67), the Arabidopsis oleosin promoter (WO • 98/45461 ) , the 
Phaseolus vulgaris phaseolin promoter (US 5,504,200), the 
Brassica Bce4 promoter (WO 91/13980) or- the legumin B4 promoter 
20. (LeB4; Baeumlein et al., 1992, Plant Journal, 2 (2):233-9), and 
promoters which bring about the seed-specific expression in 
monocots such as maize, barley, wheat, rye, rice and the like. 
Suitable promoters .which should be taken into consideration are 
the barley lpt2 or lptl gene promoter (WO 95/15389 and WO 
25 95/23230), or those described in WO 99/16890 (promoters from the 
barley hordein gene, the rice glutelin gene, the rice oryzin 
gene, the rice prolamin gene, the wheat gliadin gene, the wheat 
glutelin gene, the maize zein gene, the oat glutelin gene, the 
sorghum kasirin gene, the rye secalin gene). 



30 



In particular, it may be desired to bring about the multiparallel 
expression of the desaturases and/or elongases used in the method 
alone or in combination with other desaturases or elongases. Such 
expression cassettes can be introduced via the simultaneous 
35 transformation of a plurality of individual expression constructs 
or, preferably, by combining a plurality of expression cassettes 
on one construct. Also, it is possible to transform a plurality 
of vectors with in each case a plurality of expression cassettes 
and to transfer them to the host cell. 



40 



Promoters which are likewise especially suitable are those which 
bring about the plastid-specif ic expression since plastids are 
the compartment in which the precursors and some end products of 
lipid biosynthesis are synthetized. Suitable promoters such as 
45 the viral RNA polymerase promoter are described in WO 95/16783 
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and WO 97/06250, and the Arabidopsis clpP promoter, described in 
WO 99/46394. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells 
5 via conventional transformation or transfection techniques. The 
terms "transformation 7 ' and "transfection", conjugation and 
transduction, as used in the present context, are meant to 
comprise a multiplicity of methods known in the art for 
introducing foreign nucleic acid (for example DNA) into a host 
10 cell, including calcium phosphate or calcium chloride 

coprecipitation, DEAE-dextran-mediated transfection, lipofection, 
natural competence, chemically mediated transfer, 

electroporation or particle bombardment. Suitable methods for the 
transformation or transfection of host cells, including plant 

15 ceils, can be found in Sambrook et al. (Molecular Cloning r A 

Laboratory Manual."; 2nd edition., Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989) and other laboratory handbooks such as Methods in Molecular 

„ Biology, 1995, vol. 44, Agrobacterium protocols, ed.: Gartland — 

20 and Davey, Humana Press, Totowa, New Jersey. 

V Host cells: which -are suitable In principle for taking up the " 
.nucleic acid according to the invention, the gene product ~ 
according- to the -invention or the vector according to the lv "~ : 

25 invention are all prokaryotic or- eukaryotic organisms. The host 
organisms which aire advantageously used are organisms such as 
bacteria, fungi, yeasts or plant cells, preferably plants or 
parts thereof. Fungi, yeasts or plants are used by preference; 
especially preferably plants, very especially preferably plants 

30 such as oil crops which comprise large amounts' of lipid 

compounds, such as oilseed rape, evening primrose, hemp, thistle, 
peanut, canola, linseed, soya, safflower, sunflower, borage or 
plants such as maize, wheat, rye, oats, triticale, rice, barley, 
cotton, cassava* pepper, Tagetes, Solanaceae plants such as 

35 potato, tobacco, eggplant and tomato, Vicia species, pea, 

alfalfa, bushy plants (coffee, cocoa, tea), Salix species, trees 
(oil palm, coconut) and perennial grasses and fodder crops. 
Especially preferred plants according to the invention are oil 
crops such as soya, peanut, oilseed rape, canola, linseed, hemp, 

40 evening primrose, sunflower, safflower, trees (oil palm, 
coconut) . 

Nucleic acid sequences which are advantageously used in the 
process according to the invention are those which encode 
45 polypeptides with a A6-desaturase activity, A6-elongase activity 
or A5-desaturase activity, selected from the group consisting of: 
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a) a nucleic acid sequence with the sequence shown in SEQ ID 
NO: 1/ SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 
NO; 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID 
NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID 

5 NO: 25, SEQ ID NO: 27, SEQ ID NO: 29 or SEQ ID NO: 31, 

b) nucleic acid sequences which, owing to the degeneracy of the 
genetic code, are obtained by back translation of the amino 
acid sequences shown in SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 

10 NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID 

NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID 
NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 28, SEQ ID 
NO: 30 or SEQ ID NO: 32, 

15 c) derivatives of the nucleic acid sequences shown in SEQ ID 
NO: 1, SEQ ID "NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 
NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID 
NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID NO: 23, SEQ ID 
NO: 25, SEQ ID NO: 27, SEQ ID NO: 29 or SEQ ID NO: 31 which 

20 encode polypeptides with the amino acid sequences shown in 

SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ 
ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ 
ID NO: 18, SEQ. ID NO: 20, SEQ . ID NO: 22, SEQ ID NO: 24, SEQ 
ID NO: 26, SEQ ID NO: 28, SEQ ID NO: 30 or SEQ ID NO: 32 and 

25 which have at least 50% homology at the amino acid level, 

without the enzymatic activity of the polypeptides being 
substantially reduced. 

The abovementioned nucleic acid according to the invention 
30 originates from organisms such as animals, ciliates, fungi, 
plants such as algae or dinof lagellates which are capable of 
synthesizing PUFAs. 

The term "nucleic acid (molecule)" as used in the present context 
35 also comprises the untranslated sequence located at the 3' and at 
the 5' end of the coding gene region: at least 500, preferably 
200, especially preferably 100 nucleotides of the sequence 
upstream of the 5' terminus of the coding region and at least 
100, preferably 50, especially preferably 20 nucleotides of the 
40 sequence downstream of the 3' end of the coding gene region. An 
"isolated" nucleic acid molecule is separated from other nucleic 
acid molecules which are present in the natural source of the 
nucleic acid. An "isolated" nucleic acid preferably has no 
sequences which naturally flank the nucleic acid in the genomic 
45 DNA of the organism from which the nucleic acid originates (for 
example sequences which are present at the 5' and 3' ends of the 
nucleic acid). In different embodiments, the isolated desaturase 
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or elongase nucleic acid molecule may comprise, for example less 
than approximately 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0-5 kb or 0.1 kb 
of nucleotide sequences which naturally flank the nucleic acid 
molecule in the genomic DNA of the cell from which the nucleic 
5 acid originates. 

The nucleic acid molecules used in the process, for example a 
nucleic acid molecule with a nucleotide sequence of the SEQ ID 
NO: 1 or a part thereof, can be isolated using molecular - 

10 biological standard techniques and the sequence information 
provided herein. Also, for example a homologous sequence or 
homologous, conserved sequence regions at the DNA or amino acid 
level can be identified with the aid of comparative algorithms. 
They can be used as hybridization probe and standard 

15 hybridization techniques (as described, for example, in Sambrook 
et al., Molecular Cloning: A Laboratory Manual. 2nd ed. , Cold 
Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY, 1989) for the isolation of further 
nucleic acid sequences which are useful- in the process . Moreover, 

20 a "nucleic acid molecule comprising a complete sequence of the SEQ 
ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 
31 or a part thereof can be isolated by polymerase chain 
reaction, .where oligonucleotide primers , which are used on the - 
basis of : this -sequence or parts thereof (for example, it" is 

25 possible to isolate a nucleic acid molecule comprising the 

complete sequence or a part thereof by means of polymerase chain 
reaction using oligonucleotide primers which have been generated 
on the basis of the same sequence) ; For example, mRNA can be 
isolated from cells (for example by means of the guanidinium 

30 thiocyanate extraction method of Chirgwin et al. (1979) 

Biochemistry 18:5294-529 9) and cDNA can be generated by means of 
reverse transcriptase (for example Moloney MLV Reverse 
Transcriptase, available from Gibco/BRL, Bethesda, MD, or AMV 
Reverse Transcriptase, available from Seikagaku America, Inc., 

35 St .Petersburg, FL). Synthetic oligonucleotide primers for 
amplification by means of polymerase chain reaction can be 
generated based on one of the sequences shown in SEQ ID NO: 1, 3, 
5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 31 and that of 
Figure 5a, or with the aid of the amino acid sequences shown in 

40 SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 
or 32. A nucleic acid according to the invention can be amplified 
in accordance with standard PCR amplification techniques using 
cDNA or, alternatively, genomic DNA as template and suitable 
oligonucleotide primers. The nucleic acid amplified thus can be 

45 cloned into a suitable vector and characterized by means of DNA 
sequence analysis. Oligonucleotides which correspond to a 
desaturase nucleotide sequence can be generated by means of 
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synthetic standard methods, for example using an automatic DNA 
synthesizer. 

Homologs of the desaturase or elongase nucleic acid sequences 
5 used, with sequence SEQ ID NO: 1, 3, 5, 7 , 9, 11, 13, 15, 17, 19, 
21, 23, 25, 27, 29 or 31, means for example allelic variants with 
at least approximately 50 to 60%, preferably at least 
approximately 60 to 70%, more preferably at least approximately 
70 to 80%, 80 to 90% or 90 to 95% and even more preferably at 

10 least approximately 95%, 96%, 97%, 98%, 99% or more homology with 
one of the nucleotide sequences shown in SEQ ID NO: 1, 3, 5, 7, 
9, 11, 13, 15 r 17, 19, 21, 23, 25, 27, 29 or 31 or their 
homologs, derivatives or analogs, or parts of these. Moreover, 
isolated nucleic acid molecules of a nucleotide sequence which 

15 hybridize with one of the nucleotide sequences shown in SEQ ID 
NO: 1, 3, 5, 7, 9,~ll f 13, 15, 17, 19, 21, 23, 25, 27, 29 or 31 
or part thereof, hybridize for example under stringent 
conditions. Allelic variants comprise in particular functional 
variants which can be obtained by deletion, insertion or 

20 substitution of nucleotides from/into the sequence shown in 
SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 
29 or 31, it being intended, however, that the enzyme activity of 
the resulting synthesized proteins, is advantageously retained for 
the insertion of one or more genes. Proteins which retain the 

25 enzymatic activity of the desaturase or elongase, i.e. whose 

activity is essentially not reduced, means proteins with at least 
10%, preferably 20%, especially preferably 30%, very especially 
preferably 4 0% of the original enzyme activity in comparison with 
the protein encoded by SEQ ID NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 

30 20, 22, 24, 26, 28, 30 or 32. 

Homologs Of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 
25, 27, 29 or 31 mean for example also bacterial, fungal and 
plant homologs, truncated sequences, single-stranded DNA or RNA 
35 of the coding and noncoding DNA sequence. 

Homologs of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 
25, 27, 29 or 31 also means derivatives such as, for 
example, promoter variants. The promoters upstream of the 

40 abovementioned nucleotide sequences can be modified by one or 
more nucleotide substitutions, insertions ) and/or deletion(s) 
without, however, interfering with the functionality or activity 
of the promoters. Moreover, it is possible to increase the 
activity of the promoters by modifying their sequence or to 

45 replace them completely by more active promoters, including 
promoters from heterologous organisms. 
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The abovementioned nucleic acids and protein molecules with 
desaturase or elongase activity which are involved in the 
metabolism of lipids and fatty acids, PUFA cof actors and enzymes 
or in the transport of lipophilic compounds across membranes are 
5 used in the process according to the invention for the modulation 
of the production of compounds of the general formula I in 
transgenic plants such as maize, wheat, rye, oats, triticale, 
rice, barley, soybean, peanut, cotton, Linum species such as 
linseed or flax, Brassica species such as oilseed rape, canola 

10 and turnip rape, pepper, sunflower, borage, evening primrose and 
Tagetes, Solanaceae plants such as potato, tobacco, eggplant and 
tomato, Vicia species, pea, cassava, alfalfa, bushy plants 
(coffee, cocoa, tea), Salix species, trees (oil palm, coconut) 
and perennial grasses and fodder crops, either directly (for 

15 example when the overexpression or optimization of a fatty acid 
biosynthesis protein has a direct effect on the yield, production 
and/or production efficiency of the fatty acid from modified 
organisms) and/or can have an indirect effect which nevertheless 
leads to an increase in the yield, production and/or production 

20 efficiency of a desired compound or a~ decrease, in undesired 

compounds (for example when the modulation of the metabolism of 
lipids and fatty acids, cof actors and enzymes leads to 
modifications of the yield, production and/or production- 
.efficiency or the composition of the desired compounds within "the 

25 cells, which, in turn, may have an effect on the production 6f 

_ ; one or. more fatty acids) - 

The combination of different precursor molecules and biosynthetic 
enzymes results in the production of different fatty acid 
30 molecules, which has a decisive effect on lipid composition. 
Since polyunsaturated fatty acids (= PUFAs) are not simply 
incorporated into triacylglycerol, but also into membrane lipids. 

Lipid synthesis can be divided into two sections: the synthesis 
35 of fatty acids and their binding to sn-glycerol-3-phosphate, and 
the addition or modification of a polar head group • Conventional 
lipids which are used in membranes comprise phospholipids, 
glycolipids, sphingolipids and phosphoglycerides. Fatty acid 
synthesis starts with the conversion of acetyl-CoA into 
40 malonyl-CoA by the enzyme acetyl-CoA carboxylase or into 
acetyl-ACP by the enzyme acetyl transacylase . After a 
condensation reaction, these two product molecules together form 
acetoacetyl-ACP, which is converted by a series of condensation, 
reduction and dehydratization reactions so that a saturated fatty 
45 acid molecule with the desired chain length is obtained. The 

production of the unsaturated fatty acids from these molecules is 
catalyzed by specific desaturases, either aerobically by means of 
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molecular oxygen or anaerobically (as regards the fatty acid 
synthesis in microorganisms, see F.C. Neidhardt et al. (1996) E. 
coli and Salmonella. ASM Press: Washington, D.C., pp. 612-636 and 
references cited therein; Lengeler et al. (ed.) (1999) Biology of 
5 Procaryotes. Thieme : Stuttgart, New York, and references therein, 
and Magnuson, K. , et al. (1993) Microbiological Reviews 
57:522-542 and the references therein). 

Examples of precursors for PUFA biosynthesis are oleic acid, 

10 linoleic acid and linolenic acid. These C 18 -carbon fatty acids 
must be elongated to C 2 o and C22 "to obtain fatty acids of the 
eicosa and docosa chain type. With the aid of the desaturases 
used in the process, such as A5- and A6-desaturase and 
A6-elongase, it is possible to obtain arachidonic acid and 

15 eicosapentaenoic acid and various other long-chain PUFAs, to 

extract them and to use them for various purposes in applications 
in foodstuffs, feeding stuffs, cosmetics or pharmacology. Using 
the abovementioned enzymes, it is possible to produce preferably 
Cig .+ c 2 o fatty acids with at least two,- three, four or five 

20 double bonds in the fatty acid molecule, preferably C 2 o-fatty 

acids with advantageously three, four or five double bonds in the 
fatty acid molecule. Desaturation can take place before or after 
elongation of the fatty acid in question. This is why the 
products of desaturase activities and the further desaturation 

25 and elongation which are possible give rise to preferred PUFAs 
with a higher degree of desaturation, including a further 
elongation from C20 to C 22 -fatty acids, to give fatty acids such 
as Y-linolenic acid, dihomo-*y-linolenic acid, arachidonic acid, 
stearidonic acid, eicosatetraenoic acid or eicosapentaenoic acid. 

30 Substrates in the process according to the invention are, for 
example, linoleic acid, Y-linolenic acid, a-linolenic acid, 
dihomo-y-linolenic acid, eicosatetraenoic acid or stearidonic 
acid. Preferred substrates are linoleic acid, y-linolenic acid 
and/or a-linolenic acid, dihomo-y-linolenic acid or arachidonic 

35 acid, eicosatetraenoic acid or eicosapentaenoic acid, 

respectively. The Ci 8 - or C 2 o-fatty acids with at least two double 
bonds in the fatty acid are obtained in the process according to 
the invention in the form of the free fatty acid or in the form 
of its esters (see formula I), for example in the form of its 

40 glycerides. 

The term "glyceride" is understood as meaning a glycerol which is 
ester if ied with one, two or three carboxylic acid residues 
(mono-, di- or triglyceride) . "Glyceride" is also understood as 
45 being a mixture of various glycerides. The glyceride, or 

glyceride mixture, may comprise further additions , for example 
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free fatty acids, antioxidants, proteins, carbohydrates, vitamins 
and/or other substances. 

A "glyceride" for the purposes of the process according to the 
5 invention is furthermore understood as meaning glycerol-derived 
derivatives. These include, in addition to the above -described 
fatty acid glycerides, glycerophospholipids and glyceroglyco- 
lipids. Preferred examples which may be mentioned in this context 
are the glycerophospholipids such as lecithin (phosphatidyl- 
10 choline), cardiolipin, phosphatidylglycerol, phosphatidylserine 
and alkylacylglycerophospholipids . 

Furthermore, fatty acids must subsequently be translocated to 
various sites of modification and incorporated into the 
15 triacylglycerol storage lipid. A further important step in lipid 
synthesis is the transfer of fatty acids on the polar head 
groups, for example by the enzyme glycerol fatty acid 
acyltransf erase (see Frentzen, 1998, Lipid, 100 ( 4-5 ): 161-166 ) . 

20. Publications oh plant fatty acid -biosynthesis, desaturation, -the 
lipid metabolism and membrane transport of lipidic compounds, 
beta-oxidation, fatty acid modification and cof actors, ~ 
triacylglycerol storage and triacylglycerol assembly r including 
the references cited -therein, see the following papers: Kinney? 

25 1997, Genetic Engineering, ed.: JK Setlow, 19: 14 9-166; Ohlro£ge 
andBrowse, 1995, Plant Cell 7 -957-970 ; Shanklin and Gaboon, 
1998, Annu. Rev. Plant Physiol. Plant Mol. Biol. 49:611-641; 
Voelker, 1996, Genetic Engeneering, ed.: JK Setlow, 18:111-13; 
Gerhardt, 1992, Prog. Lipid R « 31:397-417; Giihnemann-SchSLfer & 

30 Kindl, 1995, Biochim. Biophys Acta 1256:181-186; Kunau et'al./ 
1995, Prog. Lipid Res. 34:267-342; Stymne et al., 1993, in: 
Biochemistry and Molecular Biology of Membrane and Storage Lipids 
of Plants, ed.: Murata and Somerville, Rockville, American 
Society of Plant Physiologists, 150-158, Murphy & Ross 1998, 

35 Plant Journal. 13(1): 1-16. 

The PUFAs produced in the process comprise a group of molecules 
which higher animals are no longer capable of synthesizing and 
must therefore take up, or which higher animals can no longer 
40 synthesize themselves in sufficient amounts and must thus 

additionally take them up, although they are synthesized readily 
by other organisms such as bacteria; for example, cats are no 
longer capable of synthesizing arachidonic acid. 

45 For the purposes of the invention, the terms "desaturase or 
elongase" or "desaturase or elongase polypeptide" comprises 
proteins which are implicated in the desaturation and elongation 
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of fatty acids , and their homologs, derivatives or analogs. The 
terms desaturase or elongase nucleic acid sequence (s) comprise 
nucleic acid sequences which encode a desaturase or elongase and 
in which a part can be a coding region and likewise corresponding 
5 5'- and 3 ' -untranslated sequence regions. The terms production or 
producitivity are known in the art and comprise the concentration 
of the fermentation product (compound of the formula I) which is 
formed within a specified period of time and a specified 
fermentation volume (for example kg of product per hour per 

10 liter) . The term production efficiency comprises the time span 
required for obtaining a specific amount of product (for example 
the time required by the cell for establishing a certain 
throughput rate of a fine chemical) . The term yield or 
product/carbon yield is known in the art and comprises the 

15 efficiency with which the carbon source is converted into the 
product (i.e. the fine chemical). This is usually expressed as, 
for example, kg of product per kg of carbon source. Increasing 
the yield or production of the compound results in increasing the 
amount of resulting molecules or -the suitable resulting molecules 

20 of this compound in a certain amount of culture over a specified 
period. The terms biosynthesis or biosynthetic pathway are known 
in the art and comprise the synthesis of a compound, preferably 
an organic compound, by a cell starting from intermediates, for 
example in a multi-step process which is strongly regulated. The 

25 terms catabolism or catabolic pathway are known in the art and 
comprise the cleavage of a compound, preferably an organic 
compound, by a cell to give catabolites (in more general germs, 
smaller or less complex molecules), for example in a multi-step 
process which is strongly regulated. The term metabolism is known 

30 in the art and comprises the totality of the biochemical 

reactions which take place in an organism. The metabolism of a 
certain compound (for example the metabolism of a fatty acid) 
thus comprises the totality of the biosynthetic pathways, 
modified pathways and catabolic pathways of this compound in the 

35 cell which relate to this compound. 

In a further embodiment, derivatives of the nucleic acid molecule 
according to the invention encode proteins with at least 50%, 
advantageously approximately 50 to 60%, preferably at least 

40 approximately 60 to 70% and more preferably at least 

approximately 70 to 80%, 80 to 90%, 9 0 to 95% and most preferably 
at least approximately 96%, 97%, 9 8%, 99% or more homology (= 
identity) with a complete amino acid sequence of the SEQ ID 
NO: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 or 32. 

45 The homology of the amino acid sequence can be determined over 
the entire sequence region using the program PileUp (J. Mol. 
Evolution., 25, 351-360, 1987, Higgins et al., CABIOS, 5, 
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1989:151-153) or BESTFIT or GAP (Henikoff, S; and Henikoff r J. G. 
(1992). Amino acid substitution matrices from protein blocks. 
Proc. Natl. Acad. Sci. USA 89: 10915-10919.) 

5 Moreover, the invention comprises nucleic acid molecules which 
differ from one of the nucleotide sequences shown in SEQ ID 
NO: I, 3, 5 or 11 (and parts thereof) as the result of the 
degeneracy of the genetic code and which thus encode the same 
desaturase as the desaturase which is encoded by the nucleotide 
10 sequences shown in SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 
21, 23, 25, 27, 29 or 31. 

In addition to the desaturase nucleotide sequences shown in SEQ 
ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 

15 31, the skilled worker will recognize that DNA sequence 

polymorphisms which result in modifications in the amino acid 
sequences of the desaturases or elongases may exist within a 
population. These genetic polymorphisms in the desaturase or 
elongase gene may exist between individuals within a population- 

20 as the result of natural" variation. These natural variants 

usually bring about a variance of from 1 to 5% in the nucleotide 
sequence of the desaturase or elongase gene. All and sundry of 
these nucleotide -variations and resulting amino acid ~- 
polymorphisms in-the enzyme desaturase or elongase which are the * 

25 result of natural variation and -which do not modify the 

functional activity of desaturases or elongases are also intended 
to fall under the scope of the invention. 

Nucleic acid molecules which are advantageous for the process 

30 according to the invention can be isolated on the basis of their 
homology with the desaturase or elongase nucleic acids disclosed 
herein using the sequences or part thereof as hybridization 
probe, following standard hybridization techniques under 
stringent hybridization conditions. In this context, it is 

35 possible for example to use isolated nucleic acid molecules which 
are at least 15 nucleotides in length and which hybridize under 
stringent conditions with the nucleic acid molecules which 
comprise a nucleotide sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 
13, 15, 17, 19, 21, 23, 25, 27, 29 or 31. It is also possible to 

40 use nucleic acids with at least 25, 50, 100, 250 or more 

nucleotides. The term "hybridizes under stringent conditions" as 
used in the present context is understood as describing 
hybridization and wash conditions under which nucleotide 
sequences with at least 60% homology with one another usually 

45 remain hybridized with one another. The conditions are preferably 
such that sequences which are at least approximately 65%, more 
preferably at least approximably 70% and even more preferably at 
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least approximately 7 5% or more homologous with one another 
usually remain hybridized with one another. These stringent 
conditions are known to the skilled worker and can be found in 
Current Protocols in Molecular Biology, John Wiley & Sons, N. Y. 
5 (1989), 6.3.1-6.3.6. A preferred nonlimiting example of stringent 
hybridization conditions is hybridization in 6 x sodium 
chloride/sodium citrate (SSC) at approximately 45°C, followed by 
one or more wash steps in 0.2 x SSC, 0.1% SDS at 50 to 65o C . The 
skilled worker knows that these hybridization conditxons differ 
10 depending on the type of the nucleic acid and, for example when 
organic solvents are used, with regard to the temperature and 
concentration of the buffer. For example, under "standard 
hybridization conditions" the temperature differs depending on 
the type of nucleic acid between 42°C and 58°C in aqueous buffer 
15 with a concentration of 0.1 to 5 x SSC (pH 7.2). If organic 

solvent is present" in the abovementioned buffer, for example 50% 
formamide, the temperature under standard conditions is 
approximately 42°C. The hybridization conditions for DNA:DNA 
hybrids preferably are for example 0.1 x SSC and 20<>c to 45°C. 
20 preferably between" 30 °C and 45°C. The hybridization conditions for 
DNA : RNA hybrids preferably are for example 0.1 x SSC and 30°C to 
55°C, preferably between 45°C and 55°C. The abovementioned 
hybridization temperatures are determined for example for a 
nucleic acid with a length of approximately 100 bp (- base pairs) 
25 and a G + C content of 50% in the absence of formamide. The 

skilled worker knows how to identify the hybridization conditions 
required with the aid of textbooks, such as the one mentioned 
above, or the following textbooks: Sambrook et al. f "Molecular 
Cloning", Cold Spring Harbor Laboratory, 1989; Hames and Higgins 
30 (ed.) 1985, "Nucleic Acids Hybridization: A Practical Approach", 
IRLPress at Oxford University Press, Oxford; Brown (ed.) 1991, 
"Essential Molecular Biology: A Practical Approach", IRL Press at 
Oxford University Press, Oxford. 

35 To determine the percentage homology (« identity) of two amino 
acid sequences (for example of the sequences of SEQ ID NO: 2, 4, 
6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 or 32) or of two 
nucleic acids (for example one of the sequences of SEQ ID NO: 1, 
3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 31), the 

40 sequences are written underneath each other to provide an optimal 
comparison (for example, gaps may be introduced into the sequence 
of a protein or a nucleic acid in order to generate an optimal 
alignment with the other protein or the other nucleic acid) . The 
amino residues of nucleotides at the corresponding amino acid 

45 positions or nucleotide positions are then compared. If a 

position in a sequence is occupied by the same amino acid residue 
or the same nucleotide as the corresponding position in the other 
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sequence, the molecules are homologous at this position (i.e. 
amino acid or nucleic acid "homology" as used in the present 
context corresponds to amino acid or nucleic acid "identity") . 
The percentage homology between the two sequences is a function 
5 of the number of identical positions which the sequences share 
(i.e.. percent homology = number of identical positions/total 
number of positions x 100). The terms homology and identity are 
thus to be regarded as synonymous . 

10 An isolated nucleic acid molecule which encodes a desaturase or 
elongase which is homologous to a protein sequence of SEQ ID NO: 
2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30 or 32 can 
be generated by introducing one or more nucleotide substitutions, 
additions or deletions into a nucleotide sequence of SEQ ID 

15 NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29 or 31 
so that one or more amino acid substitutions, additions or 
deletions are introduced into the protein which is encoded. 
Mutations can be introduced into one of the sequences of SEQ ID 
NO: 1, 3, 5, 7, .9, 11, 13, 15, 17, 19, 2L, 23, 25, 27, 29~or 31 

2Q -by standard -..techniques such as- site-specif ic mutagenesis and - - * 
PCR-mediated mutagenesis. It is preferred to generate . 
conservative amino acid substitutions at one or more of the 
predicted nonessential amino acid residues. In- a "conservative 
conino. acid - substitution'', the amino acid residue is substituted 

25 by an amino acid. residue with a similar side chain. Families of 
amino acid residues with similar side chains have been defined 
in the art. These families comprise amino acids with basic side 
chains (for example lysine, arginine, histidine), acidic side 
chains (for example aspartic acid, glutamic acid), uncharged 

30 polar side chains (for example glycine, asparagine, glutamine 
serine, threonine, tyrosine, cysteine), unpolar side chains (for 
example alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan), beta-branched side chains 
(for example threonine, valine, isoleucine) and aromatic side 

35 chains (for example tyrosine, phenylalanine, tryptophan, 

histidine). A predicted nonessential amino acid residue in a 
desaturase or elongase is thus preferably substituted by another 
amino acid residue from the same family of side chains. As an 
alternative, the mutations can, in a different embodiment, be 

40 introduced randomly over the entire desaturase-encoding sequence 
or part thereof, for example by means of saturation mutagenesis, 
and the resulting mutants can be screened for the desaturase • 
activity described herein in order to identify mutants which 
retain the desaturase or elongase activity. After the mutagenesis 

45 of one of the sequences of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13, 15, 
17, 19, 21, 23, 25, 27, 29 or 31 the encoded protein can be 
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expressed recombinant ly, and the activity of -the protein can be 
determined for example using the assays deserved herein. 

The invention is illustrated further by the examples which 
5 follow, but which are not to be construed as limiting. The 
conten; of all of the references, patent applications , patents 
and Published patent applications cited in the present patent 
application is herein incorporated by reference. 

10 Examples section 

Example 1: General methods 

a) General cloning methods: 

" cloning methods such as, for example, restr iction ^ages , 
agarose gel electrophoresis, purification of DNA fragments, 
transfer of nucleic acids onto nitrocellulose and nylon 
„ offlh . ane . linking of DNA fragments, transformation of 

20 "ichia coti Ind yeast cells, bacterial cultures and sequence 
analysis of recombinant DNA were carried out as described in 
Sambrook et al. (1989) (Cold Spring Harbor laboratory Press: 
ISBN 0-87969-309-6). or Kaiser, Michaelis and Mitchell (1994) 
-Methods in Yeast Genetics" (Cold Spring Harbor Laboratory Press: 

25 ISBN 0-87969-451-3). 

b) Chemicals 

Unless otherwise stated in the text, the chemicals used were 
30 obtained in analytical-grade quality from Fluka (Neu-Ulm), Merck 
(Darmstadt), Both (Karlsruhe), Serva (Heidelberg) and 
Sigma (Deisenhof en) . Solutions were made with purified, 
™en-free water, hereinbelow referred to as H 2 C, from a Milli-Q 
Water System water purification system (Millipore, Eschborn . 
35 Restriction endonuc leases , DNA-modif ying enzymes and molecular , 
biology kits were obtained from AGS (Heidelberg), Amersham 

(Braunschweig). Biometra (Gottingen), Boehringer (Mannheim), 
Genomed (Bad Oeynhausen) , New England Biolabs 

(Schwalbach/Taunus), Novagen (Madison, Wisconsin, USA), 
40 Perkin-Elmer (Weiterstadt) , Pharmacia (Freiburg,, Qiagen (Hilden) 

and Stratagene (Amsterdam, Netherlands). Unless otherwise 

specified, they were used in accordance with the manufacturer s 

instructions . 

45 
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Example 2: Isolation of total RNA and poly (A) +-RNA from plants 

Total RNA is isolated from plants such as linseed and oilseed 
rape and the like following a method described by Logemann et al. 
5 (1987, Anal. Biochem. 163, 21). The total RNA can be obtained 
from protonemal tissue from moss using the GTC method (Reski 
et al., 1994, Mol . Gen. Genet., 244:352-359). 

Example 3: Transformation of Agrobacterium 

10 

The Agrobacterium-mediated transformation of plants can be 
carried out for example using the Agrobacterium tumefaciens 
strain GV3101- (pMP90-) (Koncz and Schell, Mol. Gen. Genet. 204 
(1986) 383-396) or LBA4404- (Clontech) or C58C1 pGV2260 (Deblaere 
15 et al 1984, Nucl. Acids Res. 13, 4777-4788)). The transformation 
can be carried out by standard transformation techniques (also 
Deblaere et al. 1984). 

.Example 4: Plant transformation 

20 - 

The Agrobacterium-mediated transformation of plants can be 
carried out using standard transformation and regeneration *f."." 
techniques (Gelvin, Stanton B., Schilperoort , Robert A. , Plant 
Molecular Biology Manual, 2nd ed . , Dordrecht: Kluwer Academic 

25 Publ., 1995, in Sect., Ringbuch Zentrale Signatur: BT11-P ISBN 
0-7923-2731-4; Glick, Bernard R. r Thompson, John E. , Methods in : 
Plant Molecular Biology and Biotechnology, Boca Raton: CRC Press, 
1993, 360 S., ISBN 0-8493-5164-2). 

30 Oilseed rape can be transformed by means of cotyledon or 

hypocotyl transformation (Moloney et al.. Plant Cell 8 (1989) 
238-242; De Block et al., Plant Physiol. 91 (1989) 694-701). The 
use of antibiotics for the seletion of agrobacteria and plants 
depends on the Agrobacterium strain and the binary vector used 

35 for the transformation. Normally, oilseed rape is selected using 
kanamycin as selectable plant marker. 

The Agrobacterium-mediated gene transfer into linseed (Linum 
usitatissimum) can be carried out using for example a technique 
40 described by Mlynarova et al. (1994) Plant Cell Report 
13:282-285. 

The transformation of soya can be carried out using for example a 
technique described in EP— A— 0 042 4 047 (Pioneer Hi-Bred 
International) or in EP- A-0 0397 687, US 5,376,543, US 5, 169,770 
45 (University Toledo). 
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The transformation of plants using particle bombardment, 
Methylene glycol mediated DNA uptake or via the silicon 
rarbonaS f ibe/technigue is described for example by Fueling 

and Walbot "The maize handbook" (1993) ISBN 3-540-97826-7, 
5 Springer Verlag New York) . 

Example 5: Plasmids for plant transformation 

Binary vectors such as pBinAH (Hofgen and "^^"^ 
10 Science 66 (1990) 221-230) or pGPTV (Becker et al 1992, Plant 

M Biol 20:1195-1197) can be used for plant transformation. 

The'b!nary vectors can be constructed by ligating the cDNA in 

llnsl or antisense orientation into T-DNA. 5' of the cDNA, a 
T ! Promoter activates cDKA transcription. A polyadenylation 
15 s guenfeTs Tocateo 3' of the cDNA. The binary vectors can bear 

ear. =5= f=Kr! HH= 

20 i^STp^S,^ plasmid P Binl9 or pGPTV 

corresponds to a 1153 base-pair fragment from Beta vulgaris 
25 (Blant Mol Biol, 1999, 39:463-475). In this context, not only 
ilifonvlureas, but also imida 2 olinones such as imazethapyr or 
sulpronylur:!; may be used as antimetabolites for the selection. 

Tissue-specific expression can be achieved using a 
30 tissue-specific promoter. For example, seed-specific expression 
30 can be achieved by cloning the 0C3 or LeB4 or US P promoter or the 
kacBnUn D romoter 5' of the cDHA. However, any other 
feenpecifirr^oter element such as, for example, the napin or 
arcelin promoter (Goossens et al. 1999, Plant Phys 
^20,41 -1095-1103 and Gerhardt et al. 2000, Biochunica et 
" Biophy s ica Acta 1490 ( 1-2) :87-98) may also be used ™V-35S 
promoter or a v-ATPase CI promoter can be used for constitutive 
expression in the intact plants. 

ao t„ ^articular, genes encoding desaturases and elongases can be 
40 in part icular, g ^ fey con5tructing a 

pruraliS of ex^ession cassettes in order to mimic the metabolic 
pathway in plants. 

as within an expression casette, the protein to be expressed can be 
lei a cellular compartment using a signal peptide, for 
example for plastids, mitochondria or the endoplasmic reticulum 
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(Kermode, Crit. Rev. Plant Sci. 15, 4 ( 1996 ) 285-423) . The signal 
peptide is cloned 5' in the reading frame with the cDNA to 
achieve the subcellular localization of the fusion protein. 

5 Examples of multiexpression cassettes are given hereinbelow. 

I.) Promoter-terminator cassettes 

Expression cassettes consist of least two functional units such 
10 as a promoter and a terminator. Further desired gene sequences 
such as targeting sequences, coding regions of genes or parts 
thereof and the like can be inserted between promoter and 
terminator* To construct expression cassettes, promoters and 
terminators (USP promoter: Baeumlein et al., Mol Gen Genet, 1991, 
15 225 (3):459-67); OCS terminator: Gielen et al. EMBO J. 3 (1984) 
835ff .) are isolated with the aid of the polymerase chain 
reaction and tailor-made with flanking sequences of choice on the 
. basis of synthetic oligonucleotides. 

2Q Examples of oligonucleotides which* can be used are the following: 
USP1 upstream: CCGGAATTCGGCGCGCCGAGCTCCTCGAGCAAATTTACACATTGCCA 
USP2 upstream:- CCGGAATTCGGCGCGCCGAGCTCCTGGAGCAAATTTACACATTGCCA 
USP3 upstream: CCGGAATTCGGCGCGGCGAGCTCCTCGAGCAAATTTACACATTGGCA 
USP1 downstream: AAAACTGCAGGCGGCCGCCCACCGCGGTGGGCTGGCTATGAAGAAATT 

30 USP2 downstream: C GC GG ATCCGC TG G C TAT G AAG AAATT 

USP3 downstream: TCCCCCGGGATCGATGCCGGCAGATCTGCTGGCTATGAAGAAATT 
OCS1 upstream: AAAACTGCAGTCTAGAAGGCCTCCTGCTTTAATGAGATAT 

35 

OCS2 upstream: 

CGCGGATCCGATATCGGGCCCGCTAGCGTTAACCCTGCTTTAATGAGATAT 
OCS3 upstream: TCCCCCGGGCCATGGCCTGCTTTAATGAGATAT 

40 

0CS1 downstream: 

CCCAAGCTTGGCGCGCCGAGCTCGAATTCGTCGACGGACAATCAGTAAATTGA 

OCS2 downstream: 
45 CCCAAGCTTGGCGCGCCGAGCTCGAATTCGTCGACGGACAATCAGTAAATTGA 



OCS3 downstream: C C CAAGC TTGGC GCGC C GAGC T C GTC GAC GGAC AATCAGTAAA TT GA 
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The methods are known to the specialist worker and are generally 
known from the literature. 

5 

In a first step, a promoter and a terminator are amplified via 
PCR- Then, the terminator is cloned into a recipient plasmid and, 
in a second step, the promoter is inserted upstream of the 
terminator. This gives an expression cassette on a plasmid 
10 vehicle. The plasmids pUTl, 2 and 3 are generated on the basis of 
the plasmid pUC19. 

The constructs are defined in accordance with the invention in 
SEQ ID NO: 33, 34 and 42. They comprise the USP promoter and the 

15 OCS terminator. Based on these plasmids, the construct pUTl2 is 
generated by cutting pUTl with Sail/Seal and cutting pUT2 with 
Xhol/Scal. The fragments in the expression cassettes are ligated 
and transformed into E. coli XXI blue MRF . After picking out 
ampicillin-resistant colonies, DNA is prepared, and those clones 

20 which comprise two expression cassettes are identified by 

restriction analysis- The Xhol/Sall ligation of compatible ends 
has eliminated the two cleavage sites Xhol and Sail between the 
expression cassettes. This gives rise to plasmid pUT12, which is 
defined in SEQ ID NO: 36. pUT12 is subsequently cut again with 

25 Sail/Seal and pUT3 with Xhol/Scal. The fragments comprising the 
expression cassettes are ligated and transformed into E. coli XLI 
blue MRF. After singling out ampicillin-resistant colonies, DNA 
is prepared, and those clones which comprise three expression 
cassettes are identified by restriction analysis. In this manner, 

30 a set of multiexpression cassettes is created which can be 

exploited for inserting the desired DNA and is described in Table 
1 and can additionally incorporate further expression cassettes. 

They comprise the following elements: 

35 



40 



45 
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Table 1 





pUC19 
derivate 


Cleavage sites before 
the USP promoter 


Multiple 
cloning cleavage sites 


Cleavage sites behind the 
OCS terminator 


e 
o 


pUTl 


EcoRI/AscI/ Sacl/Xhol 


iDSt/U/iNOti/ rSci/A.oai/oiui 


Sall/EcoRI/ SacI/AscI/ 
Hindni 




pUT2 


EcoRI/AscI/ Sacl/Xhol 


BamHI/EcoRV/ Apal/Nhel/ Hpal 


Sall/EcoRI/ Sacl/AscI/ 
Hindni 




pUT3 


EcoRI/AscI/ Sacl/Xhol 


Bglll/Nael/ Clal/Smal/Ncol 


Sail/Sad/ Ascl/Hindlll 


10 


pUT12 
Double 




BstXI/NotI/ Pstl/Xbal/Siul 
and 


Sall/EcoRI/ SacI/AscI/ 
Hindlll 




expression 
cassette 




BamHI/EcoRV/ Apal/Nhel/ Hpal 








l.BstXI/NotI/ Pstl/Xbal/StuI 




15 


pUT123 

Triple 

expression 

cassette 


EcoRI/AscI/ Sacl/Xhol 


and 

2. BamHI/EcoRV/ Apal/Nhel/ 
Hpal 

and 

3. BglIl/NaeI/ Clal/Smal/Ncol 


S all/SacI/Asc l/Hi ndlll 



20 " 

Furthermore, further multiexpression cassettes can be generated ~ 
and employed for seed-specific gene expression, as described and 
as specified in greater detail in Table 2 , with the aid of the. 



25 i) USP promoter or with the aid of "the 

ii) 700 base pair- 3 ' fragment of the LeB4 promoter or with the. 
aid of the 

iii) DC3 promoter - 

30 The DC 3 promoter is described in Thomas, Plant Cell 1996, 

2 63:359-368 and consists merely of the region -117 to +26, which 
is why it therefore constitutes one of the smallest known 
seed-specific promoters. The expression cassettes can comprise 
several copies of the same promoter or else be constructed via 

35 three different promoters. 

The vectors used for the transformation of plants and the 
sequences of the inserted genes/proteins can be found in sequence 
listing SEQ ID NO: 43 to 49. 

40 

Advantageously used polylinker or polylinker-terminator- 
polylinkers can be found in the sequences SEQ ID NO: 50 to 52. 



45 
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Multiple expression cassettes 



Plasmid name ot 
the pUCl9 
derivative 


Cleavage sites before 
the respective 
promoter 


Multiple 
cloning cleavage sites 


Cleavage sites behind 
the OCS terminator 


pUTl 

(pUC19 with 
USP-OCS1) 


EcoRI/AscySacI/XhoI 


(1) BstXI/Notl/Pstl/ Xbal/Stui 


Sall/EcoRI/SacI/Ascl/ 
Hindlll 


( P UC19 with 
DC3-OCS) 


EcoRI/AscI/SacI/Xhol 


(2) BamHVEcoRV/ Apal/NheV 
Hpal 


Sall/EcoRI/Sac l/Asc V 
Hindlll 


pLeBT 

(pUC19-with 
LeB4(700)-OCS) 


EcoRI/AscUSacI/XhoI 


(3) Bglll/NaeU Clal/Smal/Ncol 


Sall/Sacl/Ascl/Hindlll 


pUD12 

(pUC 19 with 
USP-OCS1 and 
with DC3-OCS) 


EcoRI/AscI/SacI/Xhol 


(1) BstfcWlotl/ I>stl/Xbat/Stul 
and 

(2) BamHI/EcoRV/ Apal/Nhel/ 
Hpal 


Sall/EcoRI/SacI/Ascl/ 
Hindlll 


pUDL123 
Triple expression 
cassette 

(pUCl9 with 
USP/DC3 and 
LeB4-700) 


EcoRI/AscI/SacI/Xhol 


(1) BstXI/NotI/ Pstl/Xbal/Srul 
and • 

(2) BarnHI/ (EcoRV*)/ApaI/ 
Nhel/Hpal and 

(3) Bglll/Nael/ Clal/Srnal/Ncol 


Sail/Sacl/Ascl/Hindin 



10 



15 



2Q 



25 



* EcoRV cleavage site in the 700 base-pair fragment of the LeB4 
promoter ( LeB4 -700) 



Further promoters for multi-gene constructs can be generated 
30 analogously, in particular using the 

a) 2.7 kb fragment of the LeB4 promoter or with the aid of the 

b) phaseolin promoter or with the aid of the 

c) constitutive v-ATPase cl promoter. 

35 It may be particularly desirable to use further especially 
suitable promoters for constructing seed-specific 
multi-expression cassettes such as, for example, the napin 
promoter or the arcelin-5 promoter. 

II) Generation of expression constructs which comprise promoter, 
terminator and desired gene sequence for the expression of 
PUFA genes in plant expression cassettes. 

4 „ In pUT123, the A6-elongase Pp_PSEl is first inserted into the 
45 first cassette via BstXI and Xbal. Then, the moss A-6-desaturase 
<Pp_des6) is inserted into the second cassette via BamHl/Nael, 



40 
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and, finally/ the Phaeodactylum A5~desaturase (Pt_des5) is 
inserted into the third cassette via Bglll/Ncol. The triple 
construct is named pARAl. Taking into consideration 
sequence-specific restriction cleavage sites, further expression 
5 cassettes are shown in Table 3, which are named pARA2 , pARA3 and 
pARA4, can be generated. 

Table 3: Combinations of desaturases and elongases 



10 



15 



Gene plasmid 


A6-Desaturase 


A5-Desaturase 


A€-Elongase 


pARAl 


Pp_des€ 


Pt_des5 


Pp_PSEl 


pARA2 


Pt_des6 


Pt_des5 


Pp_PSEl 


pARA3 


Pt_des6 


Ce_des5 


Pp_PSEl 


pARA4 


Ce_des6 


Ce_des5 


Ce_PSEl 



Pp = Physcomitrella patens , Ft « Phaeodactylum tricornutum 
Pp_PSEl corresponds to the sequence of SEQ ID NO: 9. 

20 PSE » PUFA-specific A6-elongase 

Ce_des5 = A5-desaturase from Caenorhabditis elegans (Genbank 
Acc. No. AE078796) - - , 

Ce_des6 = A6-desaturase from Caenorhabditis elegans elegans 
(Genbank Acc. No. AF031477, bases 11-1342) 

2 5 Ce_PSEl « A6-elongase. from Caenorhabditis elegans (Genbank 
Acc, No. AF244356,.. bases. 1-867 ) 

Further desaturases or elongase sequences can also be inserted 
into the expression cassettes in the described manner, such as, 
30 for example, Genbank Acc. Nr. AF231981, NM_013402, AF206662, 
AF268031, AF226273, AF110510 or AF110509. 

iii) Transfer of expression cassettes into vectors for the 

transformation of Agrobacterium tumefaciens and for the 
35 transformation of plants 

The constructs generated thus are inserted into the binary vector 
pGPTV by means of Ascl. For this purpose, the multiple cloning 
sequence is extended by an Ascl cleavage site. For this purpose, 

40 the polylinker is synthesized de novo as two double-stranded 
oligonucleotides, thereby introducing an additional Ascl DNA 
sequence. The oligonucleotide is inserted into the vector pGPTV 
by means of EcoRI and Hindlll. The cloning techniques required 
are known to the skilled worker and can simply be found in the 

45 literature as described in Example 1- 
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Example 6: Studying the expression of a recombinant gene product 
in a transformed organism 

The activity of a recombinant gene product in the transformed 
5 host organism can be measured at the transcriptional and/or the 
translational level, 

A suitable method for determining the extent to which the gene is 
transcribed (which indicates the amount of RNA which is available 

10 for the translation of the gene product) is to carry out. a 

Northern blot as detailed hereinbelow (as reference, see Ausubel 
et al. (1988) Current Protocols in Molecular Biology, Wiley: New 
York, or the abovementioned Examples Section) , where a primer 
which is designed in such a way that it binds to the gene of 

15 interest is labeled with a detectable label (usually a 

radioactive label or a chemiluminescent label) so that, when the 
total RNA of a culture of the organism is extracted, separated on 
a gel, transferred onto a stable matrix and incubated with this 
probe, the binding and extent of the binding of the probe 

20 indicates the existence and also the amount of the mRNA for this 
gene. This information indicates the degree to which the 
transformed gene has been transcribed. Cellular total RNA can be 
prepared from cells, tissues or organs using a plurality of 
methods, all of which are known in the art, such as, for example, 

25 the method of Bormann, E.R., et al. (1992) Mol. Microbiol. 
6:317-326. 

Northern hybridization: 

30 To carry out the RNA hybridization, 20 [ig of total RNA or 1 \ig of 
poly (A)* RNA were separated by gel electrophoresis in agarose gels 
with a strength of 1.25% using formaldehyde , as described in 
Amasino (1986, Anal. Biochem. 152, 304), capillary-blotted onto 
positively charged nylon membranes Hybond N+, Amersham, 

35 Braunschweig) using 10 x SSC, immobilized using UV-light and 

prehybridized for 3 hours at 68°C using hybridization buffer (10% 
dextran sulfate w/v, 1 M NaCl, 1% SDS, 100 mg herring sperm DNA) . 
The DNA probe was labeled with the Highprime DNA labeling kit 
(Roche, Mannheim, Germany) during the prehybridization step, 

40 using alpha- 32 P-dCTP (Amersham, Braunschweig, Germany). After the 
labeled DNA probe had been added, the hybridization was carried 
out overnight at 68°C in the same buffer. The wash steps were 
carried out twice for 15 minutes using 2 x SSC and twice for 
30 minutes using 1 x SSC, 1% SDS, at 68°C. The sealed filters were 

45 exposed at -70°C for a period of 1 to 14 days. 
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Standard techniques , such as a Western blot, -can be employed for 
studying the presence or the relative amount of protein 
translated by this mRNA (see, for exajnple, Ausubel et al. (1988) 
Current Protocols in Molecular Biology, Wiley: New York). In this 
5 method, the cellular total proteins are extracted, separated by 
gel electrophoresis, transferred to a matrix such as 
nitrocellulose and incubated with a probe such as an antibody 
which binds specifically to the desired protein- This probe is 
usually provided with a chemiluminescent or colorimetric label 
10 which can be detected readily. The presence and the amount of the 
label observed indicates the presence and the amount of the 
desired mutated protein present in the cell. 

Example 7: Analysis of the effect of the recombinant proteins on 
15 the production of the desired product 

The effect of the genetic modification in plants,- fungi, algae, 
ciliates or on the production of a desired compound (such as a 
fatty acid) can be determined by -growing the modified 

20 microorganisms or the modified plant under suitable conditions 
(such as those described above) and analyzing the medium and/or 
the cellular components for the increased production of the 
-desired product (i.e. of lipids or a fatty acid). These 

analytical techniques are known to the skilled worker and " 

25 comprise spectroscopy, thin-layer chromatography, various tyes of 
staining methods, enzymatic and microbiological methods, and 
analytical chromatography such as high-performance liquid 
chromatography (see, for example, Ullman, Encyclopedia of 
Industrial Chemistry, vol. A2, pp. 89-90 and pp. 443-613, VCH: 

30 Weinheim (1985); Fallon, A., et al., (1987) "Applications of HPLC 
in Biochemistry" in: Laboratory Techniques in Biochemistry and 
Molecular Biology, vol. 17; Rehm et al. (1993) Biotechnology, 
vol. 3, chapter III; "Product recovery and purification", pp. 
469-714, VCH: Weinheim; Belter, P. A., et al. (1988) 

35 Bioseparations: downstream processing for Biotechnology, John 
Wiley and Sons; Kennedy, J.F. and Cabral, J. M.S. (1992) Recovery 
processes for biological Materials, John Wiley and Sons; 
Shaeiwitz, J. A., and Henry, J.D. (1988) Biochemical Separations, 
in: Ullmann's Encyclopedia of Industrial Chemistry, vol. B3 ; 

40 chapter 11, pp. 1-27, VCH: Weinheim; and Dechow, F.J.. (1989) 
Separation and purification techniques in biotechnology, Noyes 
Publications) - 

In addition to the abovementioned methods, plant lipids are 
45 extracted from plant material as described by Cahoon et al. 

(1999) Proc. Natl. Acad. Sci. USA 96 ( 22 ): 12935-12940 , and Browse 
et al. (1986) Analytic Biochemistry 152:141-145. The qualitative 
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and quantitative lipid and fatty acid analysis is described by 
Christie, William W., Advances in Lipid Methodology, 
^Scotland: Oily Press (Oily Press Lipid Library; 2), Chr»*«. 
W flliam W., Gas Chromatography and Lipids A Pract ica Gu.de 
5 _ Ayr , Scotland: Oily Press, 1989, Repr. 1992, IX, 307 S. (Oily 

Press Lipid Library; 1); "Progress in Lipid Research, Oxford. 

Pergamon Press, 1 (1952) - 16 (1977) u.d.T.: Progress in the 

Chemistry of Fats and Other Lipids CODEN . 

10 To determine the overall efficiency with which the compound is 
produced, it is also possible, in addition to measuring the 
fermentation end product, to analyze other components of the 
metabolic pathways which are used for producing the ^sired 
Tompounds, such as intermediates and secondary products The 

15 analytical methods comprise measurements of the nutrient 
quantities in the medium (for example sugars, hydrocarbons 
nitrogen sources, phosphate and other ions), measurements of the 
biomass composition and the growth, analysis of the production of 
usual metabolites via biosynthetic pathways, and measurements of 

20 gases which are generated during the f ermentation process . 

Standard methods for these measurements are described in Applied 
Microbial Physiology; A Practical Approach, P.M. Rhodes and 
P.P. Stanbury, ed., IRL Press, pp.. 103-129; 131-163 and 165-192 
(ISBN: 0199635773) and references cited therein. 

One example is the analysis of fatty acids ( abbreviations : fames , 
fatty acid methyl esters; GC-MS, gas liquid chromatography /mass 
spectrometry; TAG, triacylglycerol; TLC, thin-layer 
chromatography) . 

unequivocal proof for the presence of fatty acid products can be 
obtained by the analysis of recombinant organisms following 
standard analytical procedures: GC, GC-MS or TLC as variously 
described by Christie and references therein (1997, in: Advances 
35 on Lipid Methodology, Fourth ed.: Christie, Oily Press Dundee, 
119-169; 1998, gas-chromatography/mass-spectrometry methods, 
Lipids 33:343-353). 

Material to be analyzed can be disintegrated via sonif ication, 
40 glass milling, liquid nitrogen and grinding or via °^ er 

applicable methods. The material has to be centnfuged after 
disintegration. The sediment is resuspended in Aqua dest, £*£d 
for 10 min at 100°C, cooled on ice and centrifuged again, followed 
by extraction in 0.5 M sulfuric acid in methanol containing 2% 
45 dlmethoxypropane for 1 h at 90°C, leading to hydrolyzed oil and 
liquid compounds, resulting in transmethylated lipids. Thee 
faJty acid methyl esters are extracted in petrolether and finally 



25 



30 
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subjected to GC analysis using a capillary column (Chrompack, 
WCOT Fused Silica, CP-Wax-52 CB, 25 \im, 0-32 mm) at a temperature 
gradient beteen 170°C and 240°C for 20 min and 5 min at 240°C. The 
identity of resulting fatty acid methylesters has to be defined 
5 by the use of standards available from commercial sources (i.e. 
Sigma) . 

In the case of fatty acids where standards are not available 
molecule identity has to be shown via derivati2ation and 
10 subsequent GC MS analysis. For example the localization of triple 
bond fatty acids has to be shown via GC-MS after derivatization 
via 4 , 4-dimethoxyoxazoline derivatives (Christie, 1998, see 
above ) . 

15 Expression constructs in heterologous microbial systems 
Strains, Growth Conditions and Plasmids 

Escherichia coli strain XLl Blue MRF r kan (Stratagene) was used 
20 for sub-cloning the new elongase pPDesaturasel from ~ 

Physcomitrella patens. For functional expression of this gene we* 
used the Saccharomyces cerevisiae strain INVSc 1 (Invitrogen 
Co.). E. coli was grown in Luria-Bertini. broth (LB, Duchefa, : 
Haarlem, The .Netherlands ) at 37«C When necessary,, ampicillin 
25 (100 mg/liter) was added and 1.5% (w/v) agar. was included for 
solid LB„media. S. cerevisiae was grown at 30°C either in 
YPG-medium or in complete minimal dropout uracil medium (CMdum; 
see in: Ausubel, F.M., Brent, R. , Kingston, R.E., Moore, D.D., 
Seidman, J.G., Smith, J. A., Struhl, K., Albright, L.B. , 
30 Coen,D.M., and Varki, A. (1995) Current Protocols in Molecular 
Biology,. John Wiley & Sons, New York) containing either 2% (w/v) 
raffinose or glucose. For solid media 2% (w/v) Bacto™ agar 
(Difco) was included. Plasmids used for cloning and expression 
were pUC18 (Pharmacia) and pYES2 (Invitrogen Co.). 

35 

Example 8: Cloning and expression of PUFA-specif ic desaturases 
and elongases 

For expression in plants, cONA clones from SEQ ID NO: 1, 3, 5, 7, 
40 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31 were modified in 
such a way that only the coding region was amplified by means of 
polymerase chain reaction using two oligonucleotides. Care was 
taken that a consensus sequence before the start codon was 
maintained for efficient translation. To this end, either the 
45 base sequence ATA or AAA was chosen and introduced into the 

sequence before the ATG (Kozak, M - (1986) Point mutations define 
a sequence flanking the AUG initiator codon that modulates 
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translation by eukaryotic ribosomes, Cell 44, 283-292). In 
addition, a restriction cleavage site was introduced before this 
consensus triplet, which restriction cleavage site must be 
compatible with the cleavage site of the target vector into which 
5 the fragment is to be cloned and with the aid of which the 

expression of genes in microorganisms or plants is to take place. 

The PCR reaction was performed with plasmid DNA as template in a 
Thermocycler (Biometra) using the Pfu-DNA (Stratagene) polymerase 

10 and the following temperature programme: 3 minutes at 96°C, 

followed by 30 cycles with 30 seconds at 96°C, 3 0 seconds at 55°C 
and 2 minutes at 72°C, 1 cycle with 10 minutes at 72°C and stop at 
4°C. The annealing temperature was varied, depending on the 
oligonucleotides chosen. A synthesis time of approximately one 

15 minute can be assumed per kilobase pairs DNA. Further parameters 
which have an effect on the PCR such as, for example, Mg ions, 
salt, DNA polymerase and the like are known to the specialist 
worker and can be varied as required. 

20 The correct size of the amplified ~DNA fragment was verified by 
agarose-TBE gel electrophoresis. The amplified DNA was extracted 
from the gel using the QlAquick Gel Extraction Kit (QIAGEN) and 
ligated into the Smal restriction .site of the dephosphorylated 
vector pUCl8 using the Sure Clone Ligation Kit (Pharmacia), 

25 giving rise to the pUC derivatives. After the transformation of 
E. coli XL1 Blue MRF ' kan, a DNA miniprep (Riggs, M.G., & 
McLachlan, A. (19 86) A simplified screening procedure for large 
numbers of plasmid mini-preparation. BioTechniques 4, 310-313) 
was carried out on ampicillin-resistant transf ormants , and 

30 positive clones were identified by means of BamHI restriction 
analysis. The sequence of the cloned PCR product was verified by 
resequencing using the ABI PRISM Big Dye Terminator Cycle 
Sequencing Ready Reaction Kit (Perkin-Elmer , Weiterstadt) . 

35 Fatty acid analysis 

The total fatty acids were extracted from plant seeds and 
analyzed by gas chromatography. 

The seeds were taken up in 1% sodium methoxide in methanol and 
40 incubated for 20 minutes at RT. Thereafter, the mixture is washed 
with NaCl solution, and the FAMES are taken up in 0.3 ml heptane. 
The samples were separated on a ZEBRON-ZB-Wax capillary column 
(30 m, 0.32 mm, 0-25 yin; Phenomenex) in a Hewlett Packard-6850 gas 
chromatograph with flame ionization detector. The oven 
45 temperature was programmed from 70°C (1 minute hold) to 200°C at a 
rate of 2 0°C /minute, then to 250°C (5 min hold) at a rate of 
5°C/min and finally to 260°C at a rate of 5°C/min. Nitrogen was 
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used as carrier gas (4.5 ml/min at 70°C) . The fatty acids were 
identified by comparing the retention times with those of FAME 
standards (SIGMA). 

5 Expression analysis 

Result of the expression of a Phaeodactylum tricornutum 
A6-acyl-lipid desaturase, a Phaeodactylum tricornutum 
A5-acyl-lipid desaturase and the delta-6-specif ic elongase in 
10 tobacco seeds: 

Figure 2: Fatty acid profile of transgenic tobacco seeds. The 
plants were transformed with a triple expression cassette which 
expresses, under the control of the USP promoter, the delta-6-, 
15 the delta-5- and the Physcomitrella patens PpPSEl ( pAKA2 ) • 100 
transgenic tobacco and linseed plants are generated, of which 
approximately 20% synthesize arachidonic acid in the seed. 

Figure 3 : Tobacco wild-type control. 
2Q * . . J ' . 

Example 9: Purification of the desired product from transformed 
organisms 

The^desired product can be obtained from plant material or fungi, 
25 algae, ciliates, animal cells or- from the supernatant of the 

above-described cultures by various methods known in the art. If - 
the desired product is not excreted from the cells, the cells can 
be harvested from the culture by slow centrif ugation, and the 
"cells can be lyzed by standard techniques such as mechanical 
30 force or sonication. Plant, organs can be separated mechanically 
from other tissue or other organs. After homogenization, the cell 
debris is removed by centrif ugation, and the supernatant 
fraction, which comprises the soluble proteins, is stored for the 
further purification of the desired compound. If the product is 
35 excreted from desired cells, the cells are removed from the 

culture by slow centrif ugation r and the supernatant fraction is 
stored for further purification. 

The supernatant fraction of each purification method is subjected 
40 to chromatography with a suitable resin, the desired molecule 
either being retained on the chromatography resin, while many 
contaminations in the sample are not, or else the contaminations 
are retained on the resin, while the sample is not- If necessary, 
these chromatography steps can be repeated, using identical or 
45 different chromatography resins. The skilled worker is familiar 
with the selection of suitable chromatography resins and their 
most effective application for a particular molecule to be 
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purified- The purified product can be concentrated by filtration 
or ultrafiltration and stored at a temperature which provides 
maximum stability of the product. 

5 A broad spectrum of purification methods is known in the art, and 
the above purification method is not intended to be limiting. 
These purification methods are described, for example, in 
Bailey, J.E., & Ollis, D.F. , Biochemical Engineering 
Fundamentals, McGraw-Hill: New York (1986). 

10 

The identity and purity of the compounds which have been isolated 
can be determined by standard techniques of the art* These 
include high-performance liquid chromatography (HPLC), 
spectroscopic methods, staining methods, thin-layer 

15 chromatography, in particular thin-layer chromatography and flame 
ionization detection (IATROSCAN, latron, Tokio, Japan), NIRS, 
enzyme assay or microbiological methods. For an overview of these 
analytical methods, see: Patek et al. (1994) Appl. Environ. 
Microbiol. 60:133-140; Malakhova -et al- (1996) Biotekhnologiya 

20 11:27-32; and Schmidt et al. (1998) Bioprocess Engineer. 

19:67-7 0. Ulmann's Encyclopedia of Industrial Chemistry (1996) 
vol. A27, VCH: Weinheim, pp. 89-90, pp. 521-540, pp. 540-547, 
pp. 559-566, 575-581 and pp. 581-587; Michal, G (1999) 
Biochemical Pathways: An Atlas of Biochemistry and Molecular 

25 Biology, John Wiley and Sons; Fallon, A., et al. (1987) 

Applications of HPLC in Biochemistry in: Laboratory Techniques in 
Biochemistry and Molecular Biology, vol. 17. 

Equivalents 

30 

The skilled worker will or can recognize many equivalents of the 
specific embodiments according to the invention described herein 
by simply using routine experiments. These equivalents are 
intended to fall within the patent claims. 



40 



45 



0093/00064 



CA 02485060 2004-10-28 



55 



We claim: 



1. A process for the production of compounds of the general 
5 formula I: 



10 



CH, 



(I) 



in transgenic plants with a content of at least 1% by weight 
based on the total fatty acids, which process comprises the 
15 following steps: 

a) introducing, into a plant, at least one nucleic acid 
sequence which encodes a polypeptide with an 
A6-desaturase activity; and 

20 

b) introducing at least one second nucleic acid sequence 
which encodes a polypeptide with a A 6-elongase activity; 
and, 

25 c) if appropriate, introducing a third nucleic acid sequence 

which encodes a polypeptide with a A 5-desaturase 
activity; 



30 



d) followed by growing and harvesting the plants; and 

where the variables and substituents in the formula I have 
the following meanings : 

Ri = -OH, coenzyme A (thioester), phosphatidylcholine, 
35 phosphatidylethanolamine, phoshatidylglycerol , 

diphosphatidylglycerol , phosphatidylserine , 
phosphatidylinositol, sphingolipid, glycoshingolipid or a 
radical of the following general formula II 



40 



H 2 C-0— R z 



45 
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H , phosphatidylcholine- , phosphatidylethanolamine- , 
phosphat idy lglycerol- , diphosphat idylglycerol- , 
phosphatidylser ine- , phosphatidylinositol- , shingolipid- , 
glycoshingolipid-, glycoshingolipid- or saturated or 
unsaturated C2— C24— alkylcarbonyl- , 



R 3 



H 



, saturated or unsaturated C 2 — C24— alkylcarbonyl-, or 



R 2 and R 3 independently of one another represent a radical of 
the general formula la 



15 




ch=:ch 



— CH- 









CH 3 




m 




P 



(la) 



n « 3, 4 or 6, m » 3 f 4 or 5 and p » 0 or 3 • 

20 2. The process according to claim 1, wherein the substituents R 2 
and R 3 independently of one another are 
C 1 o-C22-alkylcarbonyl- . 

3. The process according to claim 1 or 2, wherein the 
25 substituents R 2 and R 3 independently of one another are Cie-, 

Ci8-/ C 2 o- or C 2 2-al)cylcarbonyl-, 

4* The method according to any of claims 1 to 3, wherein the 
substituents R 2 and R 3 independently of one another are 
30 unsaturated Ci 6 -, C 18 -, C 2 q- or C 2 2-&lkylcarbonyl- with one, 

two, three, four or five double bonds - 



35 



5. The method according to any of claims 1 to 4, wherein the 
transgenic plant is an oil crop. 

6. The method according to any of claims 1 to 5, wherein the 
transgenic plant is selected from the group consisting of 
soya, peanut, oilseed rape, canola, linseed, evening 
primrose, verbascum, thistle, hazelnut / almond, macadamia, 

40 avocado, bay, wild roses, pumpkin/ squash, pistachios, sesame, 

sunflower, safflower, borage, maize, poppy, mustard, hemp, 
castor-oil plant, olive, Calendula, Punica, oil palm, walnut 
or coconut. 



45 7. 



The method according to any of claims 1 to 6, wherein the 
compounds of the formula I are obtained from the transgenic 



0093/00064 



CA 02485060 2004-10-28 



57 

plants in the form of their oils, fats, lipids or free fatty 
acids by pressing or extraction • 

8. The process according to any of claims 1 to 7, wherein the 

5 oils, fats, lipids or free fatty acids obtained as claimed in 

claim 7 are refined, 

9. The process according to any of claims 1. to 8, wherein the 
saturated or unsaturated fatty acids present in the compounds 

10 of the formula I are liberated. 

10. The method according to any of claims 1 to 9, wherein the 
saturated or unsaturated fatty acids are liberated by 
alkaline hydrolysis or enzymatic cleavage. 

15 

11. The method according to any of claims 1 to 10, wherein the 
compounds of the general formula I are present in the 
transgenic plant at a content of at least 5% by weight, based 
on the total fatty acids. 

20 • * 

12. The process according to any of claims 1 to 11, wherein the 
nucleic acid sequences which encode the polypeptides with 
A6-desaturase activity, A6-elongase activity or A5-desaturase 
activity are selected from the group consisting "of : 

25 ■- . - ' ~ " '■' 

a) a nucleic acid sequence with the sequence shown in SEQ ID 
NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7 , SEQ ID 
NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, 

SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID 
30 NO: 23, SEQ ID NO: 25, SEQ ID NO: 27, SEQ ID NO: 29 or 

SEQ ID NO: 31, 

b) nucleic acid sequences which, owing to the degeneracy of 
the genetic code, are obtained by back translation of the 

35 amino acid sequences shown in SEQ ID NO: 2, SEQ ID NO: 4, 

SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, 
SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID 
NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, 
SEQ ID NO: 2B, SEQ ID NO: 30 or SEQ ID NO: 32, 

40 

c) derivatives of the nucleic acid sequences shown in SEQ ID 
NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID 
NO: 9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NOr 15, 

SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 21, SEQ ID 
45 NO: 23, SEQ ID NO: 25 f SEQ ID NO: 27, SEQ ID NO: 29 or 

SEQ ID NO: 31 which encode polypeptides with the amino 
acid sequences shown in SEQ ID NO: 2, SEQ ID NO: 4, 
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SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, 
SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID 
NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 26, 
SEQ ID NO: 28, SEQ ID NO: 30 or SEQ ID NO: 32 and which 
5 have at least 50% homology at the amino acid level, 

without the enzymatic activity of the polypeptide being 
substantially reduced • 

13. The process according to any of claims 1 to 12, wherein the 
10 nucleic acid sequences as claimed in claim 8 are linked with 

one or more regulatory signals in a nucleic acid construct. 

14. The method according to any of claims 1 to 13, wherein the 
nucleic acid construct comprises additional biosynthetic 

15 genes of the fatty acid or lipid metabolism selected from the 

group consisting of acyl-CoA dehydrogenase ( s ) , acyl-ACP 
[= acyl carrier protein] desaturase( s ) , acyl— ACP 
thioesterase(s) , fatty acid acyl transf erase (s) , fatty acid 
synthase(s), fatty acid hydroxylase! s ) , acetyl-coenzyme A 

20 carboxylase^ ) , acyl— coenzyme A oxidase ( s ) , fatty acid 

desaturase(s) , fatty acid acetylenases, lipoxygenases, 
triacylglycerol lipases, allene oxide synthases, 
hydroperoxide lyases or fatty acid elongase(s). 

25 



30 



35 



40 
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SEQUENCE LISTING 



<110> BASF Plant Science GmbH 

<120> Process for the production of polyunsaturated 
fatty acids in plants 

<130> 2002/271 

<140> 2002_271 
<141> 2002-04-26 

<160> 64 

<170> Patentln Vers. 2.0 

<210> 1 _ . - - 

<211> 1687 ' •< 
<212> DNA : 

<213> Borago officinalis 

<220> 
<221> CDS 

<222> (42) ..(1388) """ 
<223> A6-desaturase 

<400> 1 

tatctgccta ccctcccaaa gagagtagtc atttttcatc a atg get get caa ate 56 

Met Ala Ala Gin lie 
1 5 

aag aaa tac att acc tea gat gaa etc aag aac cac gat aaa ccc gga 104 
Lys Lys Tyr lie Thr Ser Asp Glu Leu Lys Asn His Asp Lys Pro Gly 

10 15 20 

gat eta tgg ate teg att caa ggg aaa gec tat gat gtt teg gat tgg 152 
Asp Leu Trp lie Ser lie Gin Gly Lys Ala Tyr Asp Val Ser Asp Trp 
25 30 35 
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gtg aaa gac cat cca ggt ggc age ttt ccc ttg aag agt ctt get ggt 200 

Val Lys Asp His Pro Gly Gly Ser Phe Pro Leu Lys Ser Leu Ala Gly 

40 45 50 

caa gag gta act gat gca ttt gtt gca ttc cat cct gec tct aca tgg 248 

Gin Glu Val Thr Asp Ala Phe Val Ala Phe His Pro Ala Ser Thr Trp 

55 60 65 

aag aat ctt gat aag ttt ttc act ggg tat tat ctt aaa gat tac tct 2 96 

Lys Asn Leu Asp Lys Phe Phe Thr Gly Tyr Tyr Leu Lys Asp Tyr Ser 

70 75 80 85 

gtt tct gag gtt tct aaa gat tat agg aag ctt gtg ttt gag ttt tct 344 

Val Ser Glu Val Ser Lys Asp Tyr Arg Lys Leu Val Phe Glu Phe Ser 

90 95 100 

aaa atg ggt ttg tat gac aaa aaa ggt cat att atg ttt gca act ttg 392 

Lys Met Gly Leu Tyr Asp Lys Lys Gly His He Met Phe Ala Thr Leu 

105 110 115 

tgc ttt ata gca atg ctg ttt get atg agt gtt tat ggg gtt ttg ttt 440 

Cys Phe He Ala Met Leu Phe Ala Met Ser Val Tyr Gly Val Leu Phe 

120 125 130 

tgt gag ggt gtt ttg gta cat ttg ttt tct ggg tgt ttg atg ggg ttt 488 

Cys Glu Gly Val Leu Val His Leu Phe Ser Gly Cys Leu Met Gly Phe 

135 140 145 

ctt tgg att cag agt ggt tgg att gga cat gat get ggg cat tat atg 53 6 

Leu Trp He Gin Ser Gly Trp He Gly His Asp Ala Gly His Tyr Met 

150 155 160 165 

gta gtg tct gat tea agg ctt aat aag ttt atg ggt att ttt get gca 584 

Val Val Ser Asp Ser Arg Leu Asn Lys Phe Met Gly He Phe Ala Ala 

170 175 180 



aat tgt ctt tea gga ata agt att ggt tgg tgg aaa tgg aac cat aat 
Asn Cys Leu Ser Gly He Ser He Gly Trp Trp Lys Trp Asn His Asn 
185 190 195 



632 
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gca cat cac att gcc tgt aat age ctt gaa tat gac cct gat tta caa 6 80 

Ala His His lie Ala Cys Asn Ser Leu Glu Tyr Asp Pro Asp Leu Gin 
200 205 210 

tat ata cca ttc ctt gtt gtg tct tec aag ttt ttt ggt tea etc acc 7 28 

Tyr lie Pro Phe Leu Val Val Ser Ser Lys Phe Phe Gly Ser Leu Thr 
215 220 225 

tct cat ttc tat gag aaa agg ttg act ttt gac tct tta tea aga ttc 776 

Ser His Phe Tyr Glu Lys Arg Leu Thr Phe Asp Ser Leu Ser Arg Phe 

230 235 240 245 

ttt gta agt tat caa cat tgg aca ttt tac cct att atg tgt get get 824 

Phe Val Ser Tyr Gin His Trp Thr Phe Tyr Pro lie Met Cys Ala Ala 
250 2S5 260 

agg etc aat atg tat gta caa tct etc ata atg ttg ttg acc aag aga "872 

Arg Leu Asn Met Tyr Val Gin Ser Leu lie Met" Leu Leu Thr Lys Arg 
265 270 275 

aat gtg tec tat cga get cat gaa etc ttg gga tgc eta gtg ttc teg 920 

Asn Val Ser Tyr Arg Ala His Glu Leu Leu Gly Cys Leu Val Phe Ser 
280 285 290 

att tgg tac ccg ttg ctt gtt tct tgt ttg cct aat tgg ggt gaa aga 9 68 

lie Trp Tyr Pro Leu Leu Val Ser Cys Leu Pro Asn Trp Gly Glu Arg 
295 300 305 

att atg ttt gtt att gca agt ttg tea gtg act gga atg caa caa gtt 1016 

lie Met Phe Val lie Ala Ser Leu Ser Val Thr Gly Met Gin Gin Val 

310 315 320 325 

cag ttc tec ttg aac cac ttc tct tea agt gtt tat gtt gga aag cct 1064 

Gin Phe Ser Leu Asn His Phe Ser Ser Ser Val Tyr Val Gly Lys Pro 
330 335 340 

aaa ggg aat aat tgg ttt gag aaa caa acg gat ggg aca ctt gac att 1112 

Lys Gly Asn Asn Trp Phe Glu Lys Gin Thr Asp Gly Thr Leu Asp lie 
345 350 355 
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tct tgt cct cct tgg atg gat tgg ttt cat ggt gga ttg caa ttc caa 1160 
Ser Cys Pro Pro Trp Met Asp Trp Phe His Gly Gly Leu Gin Phe Gin 
360 365 370 

att gag cat cat ttg ttt ccc aag atg cct aga tgc aac ctt agg aaa 1208 
He Glu His His Leu Phe Pro Lys Met Pro Arg Cys Asn Leu Arg Lys 
375 380 3a5 



ate teg ccc tac gtg ate gag tta tgc aag aaa cat aat ttg cct tac 1256 
He Ser Pro Tyr Val He Glu Leu Cys Lys Lys His Asn Leu Pro Tyr 
390 395 400 4 05 

aat tat gca tct ttc tec aag gee aat gaa atg aca etc aga aca ttg 1304 
Asn Tyr Ala ser Phe Ser Lys Ala Asn Glu Met Thr Leu Arg Thr Leu 
410 415 420 

agg aac aca gca ttg cag get agg gat ata acc aag ccg etc ccg aag 1352 
Arg Asn Thr Ala Leu Gin Ala Arg Asp lie Thr Lys Pro Leu Pro Lys 
425 430 435 



aat ttg gta tgg gaa get ctt cac act cat ggt taa aattaccctt 
Asn Leu Val Trp Glu Ala Leu His Thr His Gly 
440 445 



1398 



agttcatgta ataatttgag attatgtatc tcctatgttt gtgtcttgtc ttggttctac 1458 
ttgttggagt cattgeaact tgtcttttat ggtttattag atgtttttta atatatttta 1518 
gaggttttgc tttcatctcc attattgatg aataaggagt tgcatattgt caattgttgt 1578 
gctcaatatc tgatattttg gaatgtactt tgtaccactg tgttttcagt tgaagctcat 1638 
gtgtacttct atagactttg tttaaatggt tatgaaaaaa aaaaaaaaa i 6 87 



<210> 2 
<211> 448 
<212> PRT 

<213> Borago officinalis 
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<400> 2 

Met Ala Ala Gin lie Lys Lys Tyr lie Thr Ser Asp Glu Leu Lys Asn 
1 5 10 15 

His Asp Lys Pro Gly Asp Leu Trp lie Ser lie Gin Gly Lys Ala Tyr 
20 25 30 

Asp Val Ser Asp Trp Val Lys Asp His Pro Gly Gly Ser Phe Pro Leu 
35 40 45 

Lys Ser Leu Ala Gly Gin Glu Val Thr Asp Ala Phe Val Ala Phe His 
50 55 60 

Pro Ala Ser Thr Trp Lys Asn Leu Asp Lys Phe Phe Thr Gly Tyr Tyr 
65 70 75 80 

Leu Lys Asp Tyr Ser Val Ser Glu Val Ser Lys Asp Tyr Arg Lys Leu 

85 ; " ' 90 ' 95- 

Val Phe Glu Phe Ser Lys Met Gly Leu Tyr Asp Lys Lys Gly His lie 

100 -- ~ • 105 *: ■ 110 

Met Phe Ala Thr Leu Cys Phe lie Ala Met Leu Phe Ala Met Ser Val 
115 120 . 125 

Tyr Gly Val Leu Phe Cys Glu Gly Val Leu Val His Leu Phe Ser Gly 
130 135 140 

Cys Leu Met Gly Phe Leu Trp lie Gin Ser Gly Trp lie Gly His Asp 
145 150 155 160 

Ala Gly His Tyr Met Val Val Ser Asp Ser Arg Leu Asn Lys Phe Met 
165 170 175 

Gly lie Phe Ala Ala Asn Cys Leu Ser Gly lie Ser lie Gly Trp Trp 
180 185 190 



Lys Trp Asn His Asn Ala His His He Ala Cys Asn Ser Leu Glu Tyr 
195 200 205 
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Asp Pro Asp Leu Gin Tyr lie Pro Phe Leu Val Val Ser Ser Lys Phe 
210 215 220 

Phe Gly Ser Leu Thr Ser His Phe Tyr Glu Lys Arg Leu Thr Phe Asp 
225 230 235 240 

Ser Leu Ser Arg Phe Phe Val Ser Tyr Gin His Trp Thr Phe Tyr Pro 
245 250 255 

lie Met Cys Ala Ala Arg Leu Asa Met Tyr Val Gin Ser Leu lie Met 
260 265 270 

Leu Leu Thr Lys Arg Asn Val Ser Tyr Arg Ala His Glu Leu Leu Gly 
275 280 285 

Cys Leu Val Phe Ser lie Trp Tyr Pro Leu Leu Val Ser Cys Leu Pro 
290 295 300 

Asn Trp Gly Glu Arg lie Met Phe Val lie Ala Ser Leu Ser Val Thr 
305 310 315 320 

Gly Met Gin Gin Val Gin Phe Ser Leu Asn His Phe Ser Ser Ser Val 
325 330 335 

Tyr Val Gly Lys Pro Lys Gly Asn Asn Trp Phe Glu Lys Gin Thr Asp 
340 345 350 

Gly Thr Leu Asp lie Ser Cys Pro Pro Trp Met Asp Trp Phe His Gly 
355 360 365 

Gly Leu Gin Phe Gin He Glu His His Leu Phe Pro Lys Met Pro Arg 
370 375 380 

Cys Asn Leu Arg Lys He Ser Pro Tyr Val lie Glu Leu Cys Lys Lys 
385 390 395 400 

His Asn Leu Pro Tyr Asn Tyr Ala Ser Phe Ser Lys Ala Asn Glu Met 
405 410 415 



Thr Leu Arg Thr Leu Arg Asn Thr Ala Leu Gin Ala Arg Asp He Thr 
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420 425 430 

Lys Pro Leu Pro Lys Asn Leu Val Trp Glu Ala Leu His Thr His Gly 
435 440 445 



<210> 3 
<211> 1192 
<212> DNA 

<213> Physcomitrella patens 

<220> 

<221> CDS 

<222> (58), .(930) 

<223> A6-elongase 

<400> 3 - - " ■ 

ctgcttcgtc tcatcttggg ggtgtgattc gggagtgggt tgagttggtg. gagcgca 57 

atg .gag gtc gtg gag aga ttc tac ggt gag- ttg gat. ggg aag gtc -teg "105 
Met Glu Val Val Glu Arg Phe Tyr Gly Glu Leu Asp Gly Lys Val Ser 
1 5 10 15 

cag ggc gtg aat gca ttg ctg ggt agt ttt ggg gtg gag ttg acg gat 153 
Gin Gly Val Asn Ala Leu Leu Gly Ser Phe Gly Vai Glu Leu Thr Asp 
20 25 30 

acg ccc act acc aaa ggc ttg ccc etc gtt gac agt ccc aca ccc ate 201 
Thr Pro Thr Thr Lys Gly Leu Pro Leu Val Asp Ser Pro Thr Pro He 
35 40 45 

gtc etc ggt gtt tct gta tac ttg act att gtc att gga ggg ctt ttg 249 
Val Leu Gly Val Ser Val Tyr Leu Thr He Val He Gly Gly Leu Leu 
50 55 60 

tgg ata aag gec agg gat ctg aaa ccg cgc gec teg gag cca ttt ttg 297 
Trp He Lys Ala Arg Asp Leu Lys Pro Arg Ala Ser Glu Pro Phe Leu 
65 70 75 80 



etc caa get ttg gtg ctt gtg cac aac ctg ttc tgt ttt gcg etc agt 345 
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Leu Gin Ala Leu Val Leu Val His Asn Leu Phe Cys Phe Ala Leu Ser 

85 90 95 

ctg tat atg tgc gtg ggc ate get tat cag get att acc tgg egg tac 3 93 

Leu Tyr Met Cys Val Gly lie Ala Tyr Gin Ala lie Thr Trp Arg Tyr 
100 105 110 

tct etc tgg ggc aat gca tac aat cct aaa cat aaa gag atg gcg att 441 

Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys His Lys Glu Met Ala lie 
115 120 125 

ctg gta tac ttg ttc tac atg tct aag tac gtg gaa ttc atg gat acc 489 

Leu Val Tyr Leu Phe Tyr Wet Ser Lys Tyr Val Glu Phe Met Asp Thr 
130 135 140 

gtt ate atg ata ctg aag cgc age acc agg caa ata age ttc etc cac 537 

Val He Met He Leu Lys Arg Ser Thr Arg Gin He Ser Phe Leu His 

145 150 155 160 

gtt tat cat cat tct tea att tec etc att tgg tgg get att get cat 585 

Val Tyr His His Ser Ser He Ser Leu He Trp Trp Ala lie Ala His 

165 170 175 

cac get cct ggc ggt gaa gca tat tgg tct gcg get ctg aac tea gga 633 

His Ala Pro Gly Gly Glu Ala Tyr Trp Ser Ala Ala Leu Asn Ser Gly 
180 185 190 

gtg cat gtt etc atg tat gcg tat tac ttc ttg get gee tgc ctt cga 681 

Val His Val Leu Met Tyr Ala Tyr Tyr Phe Leu Ala Ala Cys Leu Arg 
195 200 205 

agt age cca aag tta aaa aat aag tac ctt ttt tgg ggc agg tac ttg 729 

Ser Ser Pro Lys Leu Lys Asn Lys Tyr Leu Phe Trp Gly Arg Tyr Leu 
210 215 220 

aca caa ttc caa atg ttc cag ttt atg ctg aac tta gtg cag get tac 777 

Thr Gin Phe Gin Met Phe Gin Phe Met Leu Asn Leu Val Gin Ala Tyr 

225 230 235 240 



tac gac atg aaa acg aat gcg cca tat cca caa tgg ctg ate aag att 



825 
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Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro Gin Trp Leu lie Lys lie 
245 250 255 

ttg ttc tac tac atg ate teg ttg ctg ttt ctt ttc ggc aat ttt tac 873 
Leu Phe Tyr Tyr Met lie Ser Leu Leu Phe Leu Phe Gly Asn Phe Tyr 
260 265 270 

gta caa aaa tac ate aaa ccc tct gac gga aag caa aag gga get aaa 921 
Val Gin Lys Tyr lie Lys Pro Ser Asp Gly Lys Gin Lys Gly Ala Lys 
275 280 285 

act gag tga gctgtatcaa gecatagaaa ctctattatg ttagaacctg 970 
Thr Glu 
290 

aagttggtgc tttcttatct ccacttatct tttaagcagc atcagttttg aaatgatgtg 1030 

tgggcgtggt ctgeaagtag tcatcaatat aatcggcctg agcacttcag atggattgtt 1090 

agaacatgag taaaagcggt tattaeggtg tttattttgt accaaatcac cgcacgggtg 1150 
aattgaaata tttcagattt gatcaatttc atctgaaaaa aa 1192 



<210> 4 
<211> 290 
<212> PKT 

<213> Physcomitrella patens 
<400> 4 

Met Glu Val Val Glu Arg Phe Tyr Gly Glu Leu Asp Gly Lys Val Ser 
15 10 15 

Gin Gly Val Asn Ala Leu Leu Gly Ser Phe Gly Val Glu Leu Thr Asp 
20 25 30 

Thr Pro Thr Thr Lys Gly Leu Pro Leu Val Asp Ser Pro Thr Pro He 
35 40 45 



Val Leu Gly Val Ser Val Tyr Leu Thr He Val lie Gly Gly Leu Leu 
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50 55 60 < 

Trp lie Lys Ala Arg Asp Leu Lys Pro Arg Ala Ser Glu Pro Phe Leu 
65 70 75 80 

Leu Gin Ala Leu Val Leu Val His Asn Leu Phe Cys Phe Ala Leu Ser 

85 90 95 

Leu Tyr Met Cys Val Gly lie Ala Tyr Gin Ala lie Thr Trp Arg Tyr 
100 105 110 

Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys His Lys Glu Met Ala lie 
115 120 125 

Leu Val Tyr Leu Phe Tyr Met Ser Lys Tyr Val Glu Phe Met Asp Thr 
130 135 140 

Val lie Met He Leu Lys Arg Ser Thr Arg Gin He Ser Phe Leu His 
145 150 155 160 

Val Tyr His His Ser Ser He Ser Leu He Trp Trp Ala He Ala His 
165 170 175 

His Ala Pro Gly Gly Glu Ala Tyr Trp Ser Ala Ala Leu Asn Ser Gly 
180 185 190 

Val His Val Leu Met Tyr Ala Tyr Tyr Phe Leu Ala Ala Cys Leu Arg 
195 200 205 

Ser Ser Pro Lys Leu Lys Asn Lys Tyr Leu Phe Trp Gly Arg Tyr Leu 
210 215 220 

Thr Gin Phe Gin Met Phe Gin Phe Met Leu Asn Leu Val Gin Ala Tyr 
225 230 235 240 

Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro Gin Trp Leu He Lys He 
245 250 255 



Leu Phe Tyr Tyr Met He Ser Leu Leu Phe Leu Phe Gly Asn Phe Tyr 
260 265 270 
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Val Gin Lys Tyr lie Lys Pro Ser Asp Gly Lys Gin Lys Gly Ala Lys 
275 280 285 

Thr Glu 
290 



<210> 5 
<211> 1054 
<212> DNA 

<213> Thraustochytrium 

<220> 

<2 21> CDS 

<222> (43). -(858) 

<223> A6-elongase 

<400> 5 

gaattcggca cgagagcgcg cggagcggag acctcggccg eg atg atg gag ccg 54 

Met Met Glu Pro ' 

- — - i 

etc gac agg tac agg gcg ctg gcg gag etc gee gcg agg tac gec age -102* 
Leu Asp Arg Tyr Arg Ala Leu Ala Glu Leu Ala Ala Arg Tyr Ala Ser 
5 10 15 20 

teg gcg gee ttc aag tgg caa gtc acg tac gac gec aag gac age ttc 150 
Ser Ala Ala Phe Lys Trp Gin Val Thr Tyr Asp Ala Lys Asp Ser Phe 

25 30 35 

gtc ggg ccc ctg gga ate egg gag ccg etc ggg etc ctg gtg ggc tec 198 
Val Gly Pro Leu Gly lie Arg Glu Pro Leu Gly Leu Leu Val Gly Ser 
40 45 50 

gtg gtc etc tac ctg age ctg ctg gee gtg gtc tac gcg ctg egg aac 2 46 
Val Val Leu Tyr Leu Ser Leu Leu Ala Val Val Tyr Ala Leu Arg Asn 
55 60 65 



tac ctt ggc ggc etc atg gcg etc cgc age gtg cat aac etc ggg etc 294 
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Tyr Leu Gly 
70 

tgc etc ttc 

Cys Leu Phe 
B5 

cag gat ggg 

Gin Asp Gly 

aag cat ccg 
Lys His Pro 

ate tgg gag 
lie Trp Glu 
135 

etc cgc ttc 
Leu Arg Phe 
150 

gec ate gac 
Ala lie Asp 
165 

aat get ttc 
Asn Ala Phe 



ccg aag ggc 
Pro Lys Gly 

att ttc age 
lie Phe Ser 
215 



Gly Leu Met Ala 
75 

teg ggc gee gtg 
Ser Gly Ala Val 
90 

cac ttt cgc age 
His Phe Arg Ser 
105 

cac ttc cag etc 
His Phe Gin Leu 
120 

tgg ttc gac acg 
Trp Phe Asp Thr 



ctg cac gtc ttg 
Leu His Val Leu 
155 

cac ate ttt etc 
His lie Phe Leu 
170 

ate cac ace gtc 
lie His Thr Val 
185 

ttg cgc ccg ctt 
Leu Arg Pro Leu 
200 

ate ggc ate cat 
He Gly He His 



12 

Leu Arg Ser Val 

tgg ate tac acg 
Trp He Tyr Thr 
95 

etc gag gcg gca 
Leu Glu Ala Ala 
110 

ate age ttg etc 
He Ser Leu Leu 
125 

gtg etc etc ate 
Val Leu Leu He 
140 

cac cae gee acg 
His His Ala Thr 



teg tec ate aag 
Ser Ser lie Lys 
175 

atg tac gcg cac 
Met Tyr Ala His 
190 

att acg cag ttg 
He Thr Gin Leu 
205 

ace gee att tac 
Thr Ala He Tyr 
220 



His Ash Leu 
80 

age tac etc 
Ser Tyr Leu 

acg tgc gag 
Thr Cys Glu 



ttt gcg ctg 
Phe Ala Leu 
130 

gtc aag ggc 
Val Lys Gly 
145 

acc ttt tgg 
Thr Phe Trp 
160 

tac ggc gtc 
Tyr Gly Val 

tac ttc cgc 
Tyr Phe Arg 



cag ate gtc 
Gin He Val 
210 

tgg cac tac 
Trp His Tyr 
225 



Gly Leu 



atg ate 342 
Met He 
100 

ccg etc 390 

Pro Leu 

115 

tec aag 438 
Ser Lys 

aac aag 486 
Asn Lys 

etc tac 534 
Leu Tyr 

gcg gtc 582 
Ala Val 
180 

cca ttc 630 

Pro Phe 

195 

cag ttc 6 78 
Gin Phe 



gac tgc 726 
Asp Cys 



gag ccg etc gtg cat acc cac ttt tgg gaa tac gtc acg ccc tac ctt 



774 
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Glu Pro Leu Val His Thr His Phe Trp Glu Tyr Val Thr Pro Tyr Leu 
230 235 240 

ttc gtc gtg ccc ttc etc ate etc ttt ttc aat ttt tac ctg cag cag 822 
Phe Val Val Pro Phe Leu He Leu Phe Phe Asn Phe Tyr Leu Gin Gin 
245 250 255 260 

tac gtc etc gcg ccc gca aaa acc aag aag gca tag ccacgtaaca 868 
Tyr Val Leu Ala Pro Ala Lys Thr Lys Lys Ala 
265 270 

gtagaccagc agegecgagg acgcgtgccg cgttatcgcg aagcacgaaa taaagaagat 928 

catttgattc aacgaggcta cttgcggcca cgagaaaaaa aaaaaaaaaa aaaaaaaaaa 988 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1048 

ctcgag 1054 



<210> 6 
<211> 271 
<212> PRT 

<213> Thraustochytrium 
<400> 6 

Met Met Glu Pro Leu Asp Arg Tyr Arg Ala Leu Ala Glu Leu Ala Ala 
! 5 10 15 

Arg Tyr Ala Ser Ser Ala Ala Phe Lys Trp Gin Val Thr Tyr Asp Ala 
20 25 30 

Lys Asp Ser Phe Val Gly Pro Leu Gly He Arg Glu Pro Leu Gly Leu 
35 40 45 

Leu Val Gly Ser Val Val Leu Tyr Leu Ser Leu Leu Ala Val Val Tyr 
50 55 60 



Ala Leu Arg Asn Tyr Leu Gly Gly Leu Met Ala Leu Arg Ser Val His 
65 70 75 80 
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Asn Leu Gly Leu Cys Leu Phe Ser Gly Ala Val Trp lie Tyr Thr Ser 

85 90 95 

Tyr Leu Met lie Gin Asp Gly His Phe Arg Ser Leu Glu Ala Ala Thr 
100 105 HO 

Cys Glu Pro Leu Lys His Pro His Phe Gin Leu He Ser Leu Leu Phe 
115 120 125 

Ala Leu ser Lys lie Trp Glu Trp Phe Asp Thr Val Leu Leu lie Val 
130 135 140 

Lys Gly Asn Lys Leu Arg Phe Leu His Val Leu His His Ala Thr Thr 
145 150 155 ' i 6 o 

Phe Trp Leu Tyr Ala He Asp His He Phe Leu Ser Ser He Lys Tyr 
165 170 175 

Gly val Ala Val Asn Ala Phe lie His Thr val Met Tyr Ala His Tyr 
180 185 190 

Phe Arg Pro Phe Pro Lys Gly Leu Arg Pro Leu He Thr Gin Leu Gin 
195 200 205 

He val Gin Phe He Phe Ser He Gly He His Thr Ala He Tyr Trp 
210 215 220 

His Tyr Asp Cys Glu Pro Leu Val His Thr His Phe Trp Glu Tyr Val 
225 230 235 240 

Thr Pro Tyr Leu Phe Val Val Pro Phe Leu He Leu Phe Phe Asn Phe 
245 250 255 

Tyr Leu Gin Gin Tyr Val Leu Ala Pro Ala Lys Thr Lys Lys Ala 
260 265 270 



<210> 
<211> 



7 

2040 
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<212> DNA 

<213> Ceratodon purpureus 

<220> 
<221> CDS 

<222> (176).. (1627) 
<223> A6-desaturase 

<400> 7 

ctcaggcagg tctcagttga tgagacgctg agttctgaat cctttgagct gtgtcaggct 60 

cggcacttgt gggatggtga aggagtgatc gatcaggagt gcaggagctg cattagtttc 120 

tcagggtcga tcaggttatt ctgaaaaagg ctgcgtctgt gagcagtttg caaaa atg 178 

Met 
l 

gcc etc gtt acc gac ttt ctg aac ttt ctg ggc acg aca tgg age aag 22 6 
Ala Leu Val Thr Asp Phe Leu Asn Phe Leu Gly Thr Thr Trp Ser Lys 
5 10 15 

tac age gtg tac acc cat age tat get gga aac tat ggg cct act ttg 274 
Tyr Ser Val Tyr Thr His Ser Tyr Ala Gly Asn Tyr Gly Pro Thr Leu 
20 25 30 

aag cac gcc aaa aag gtt tct get caa ggt aaa act gcg gga cag aca 322 
Lys His Ala Lys Lys Val Ser Ala Gin Gly Lys Thr Ala Gly Gin Thr 
35 40 45 

ctg aga cag aga teg gtg cag gac aaa aag cca ggc act tac tct ctg 37 0 
Leu Arg Gin Arg Ser Val Gin Asp Lys Lys Pro Gly Thr Tyr Ser Leu 
50 55 60 65 

gcc gat gtt get tct cac gac agg cct gga gac tgc tgg atg ate gtc 418 
Ala Asp Val Ala Ser His Asp Arg Pro Gly Asp Cys Trp Met lie Val 

70 75 80 



aaa gag aag gtg tat gat att age cgt ttt gcg gac gac cac cct gga 
Lys Glu Lys Val Tyr Asp He Ser Arg Phe Ala Asp Asp His Pro Gly 
85 90 95 



466 
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ggg acg gta att age acc tac ttt ggg egg gat ggc aca gac gtt ttc 514 
Gly Thr Val lie Ser Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val Phe 
100 105 no 

gca aca ttc cat cca cct gee gca tgg aag caa etc aat gac tac tac 562 
Ala Thr Phe His Pro Pro Ala Ala Trp Lys Gin Leu Asn Asp Tyr Tyr 
115 120 125 



att gga gac ctt get agg gaa gag ccc ctt gat gaa ttg ctt aaa gac 610 
lie Gly Asp Leu Ala Arg Glu Glu Pro Leu Asp Glu Leu Leu Lys Asp 

145 



130 135 140 



tac aga gat atg aga gec gag ttt gtt aga gaa ggg ctt ttc aag agt 658 
Tyr Arg Asp Met Arg Ala Glu Phe Val Arg Glu Gly Leu Phe Lys Ser 
150 155 160 

tec aag gec tgg ttc ctg ctt cag act ctg att aat gca get etc ttt 706 
Ser Lys Ala Trp Phe Leu Leu Gin Thr Leu lie Asn Ala Ala Leu Phe 
165 170 175 

get gcg age att gcg act ate tgt tac gac aag agt tac tgg get att 754 
Ala Ala ser lie Ala Thr He Cys Tyr Asp Lys Ser Tyr Trp Ala He 
180 185 190 

gtg ctg tea gee agt ttg atg ggt etc ttc gtc caa cag tgt gga tgg 802 
Val Leu ser Ala Ser Leu Met Gly Leu Phe Val Gin Gin Cys Gly Trp 
195 200 205 

ctt gec cat gat ttc ctt cat caa cag gtc ttt gag aac cgt acc gcg 850 
Leu Ala His Asp Phe Leu His Gin Gin val Phe Glu Asn Arg Thr Ala 
210 2 15 220 " 225 

aac tec ttc ttt ggc tat ttg ttc ggc aat tgc gtg ctt ggc ttt agt 898 
Asn ser Phe Phe Gly Tyr Leu Phe Gly Asn Cys Val Leu Gly Phe Ser 
230 235 240 

gta tea tgg tgg agg acg aag cac aac att cat cat act get ccg aat 946 
Val ser Trp Trp Arg Thr Lys His Asn He His His Thr Ala Pro Asn 
245 250 255 



0093/00064 



CA 02485060 2004-10-28 



17 

gag tgc gac gaa cag tac aca cert eta gac gaa gac att gat act etc 994 

Glu Cys Asp Glu Gin Tyr Thr Pro Leu Asp Glu Asp lie Asp Thr Leu 

260 265 270 

ccc ate att gee tgg age aag gaa att ttg gec acc gtt gag age aag 1042 
Pro lie lie Ala Trp Ser Lys Glu lie Leu Ala Thr Val Glu Ser Lys 
275 280 285 

aga att ttg cga gtg ctt caa tat cag cac tac atg att ctg cct eta 1090 
Arg He Leu Arg Val Leu Gin Tyr Gin Kis Tyr Met He Leu Pro Leu 
290 295 300 305 

ttg ttc atg gee egg tac agt tgg act ttt gga agt ttg etc ttc aca 113 8 
Leu Phe Met Ala Arg Tyr Ser Trp Thr Phe Gly Ser Leu Leu Phe Thr 
310 315 - 320 

ttc aat cct gat ttg age acgacc aag gga ttg ata gag aag gga aca 1186 
Phe Asn Pro Asp Leu Ser Thr Thr Lys Gly Leu He Glu Lys Gly Thr 
325 330 335 

gtt get ttt cac tac gee tgg ttc agt tgg get gcg ttc cat att ttg 1234 
Val Ala Phe His Tyr Ala Trp Phe Ser Trp Ala Ala Phe His He Leu 
340 345 . 350 

ccg ggt gtc get aag cct ctt gcg tgg atg gta gca act gag ctt gtg 12 82 
Pro Gly Val Ala Lys Pro Leu Ala Trp Met Val Ala Thr Glu Leu Val 
355 360 365 

gec ggt ttg ttg ttg gga ttc gtg ttt acg ttg agt cac aat gga aag 1330 
Ala Gly Leu Leu Leu Gly Phe Val Phe Thr Leu Ser His Asn Gly Lys 
370 375 380 385 

gag gtt tac aat gaa teg aag gac ttc gtg aga gec cag gtt att acc 13 7 8 
Glu Val Tyr Asn Glu Ser Lys Asp Phe Val Arg Ala Gin Val lie Thr 
390 395 400 

acc cgt aac acc aag cga ggc tgg ttc aac gat tgg ttc act ggg gga 14 2 6 
Thr Arg Asn Thr Lys Arg Gly Trp Phe Asn Asp Trp Phe Thr Gly Gly 
405 410 415 
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at t aaa cat cac ctg ttt cca aca atg ccc agg cac 1474 

- z z « - - - - - - ~ - - 

420 425 

* ccc aag ate gca cct cag gtc gag get ctt tgc aag aag cac 1522 
aac tac ccc aag ate g ^ ^ ^ cys Lyg Lys His 

Asn Tyr Pro Lys lie AJ.a 

435 440 

t «e aat aat gtc tec gtc gtt ggt gec tet gtc gcg gtt 1570 
ggc etc gag tac gat aat g ^ ^ ^ ^ 

Gly Leu Glu Tyr Asp Asn Val Ser Val va y ^ 

455 

occ etc aag gaa att get gat gaa gcg tea att egg ctt cac 1618 
Z Z Z Z LyI L He A!a As P 1- - Se, He A,g £ -i. 



470 475 



1667 



, et esc »» .»c«t 9 .c t .«c. t «« «c 9 c= t9 ,c 

Ala His 

„aotacttqq cagtgtgttc aaceggageg cactgaaaat 1727 
tacctcaaat gttegggage aggtgcttgg cag g y 

gt9 c. 9 *«c ««ttc.M. — ~" """"" """"" 1,87 

««««« ...^ 1647 
«c„ 9 c»« c, t a« 9 ,c g 9 « 9 cc g a,a ,30, 

„ aCT + a a atctagtgta aatttctatt tccgcatgta 1967 
gtttacatta ttagttggca caacgagtag atctagtg 

* rcttitctat tttcctaaaa aaaaaaaaaa aaaaaaaaaa 2027 
atattactct gaatatatac cgttatctar 

2040 

aaaaaaaaaa aaa 



<210> 8 
<211> 483 
<212> PRT 

<213> ceratodon purpureus 
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<400> 8 

Met Ala Leu Val 
1 

Lys Tyr Ser Val 
20 

Leu Lys His Ala 
35 

Thr Leu Arg Gin 
50 

Leu Ala Asp Val 
65 

Val Lys Glu Lys 



Gly Gly Thr Val 

. 100 

Phe Ala Thr Phe 
115 

Tyr lie Gly Asp 
130 

Asp Tyr Arg Asp 
145 

Ser Ser Lys Ala 

Phe Ala Ala Ser 
180 



Thr Asp Phe Leu 
5 

Tyr Thr His Ser 



Lys Lys Val Ser 
40 

Arg Ser Val Gin 
55 

Ala Ser His Asp 
70 

Val Tyr Asp lie 
85 

lie Ser Thr Tyr 



His Pro Pro Ala 
120 

Leu Ala Arg Glu 
135 

Met Arg Ala Glu 
150 

Trp Phe Leu Leu 
165 

lie Ala Thr He 



19 

Asn Phe Leu Gly 
10 

Tyr Ala Gly Asn 
25 

Ala Gin Gly Lys 



Asp Lys Lys Pro 
60 

Arg Pro Gly Asp 
75 

Ser Arg Phe Ala 
- 90 

Phe Gly Arg Asp 

105 

Ala Trp Lys Gin 



Glu Pro Leu Asp 
140 

Phe Val Arg Glu 
155 

Gin Thr Leu He 
170 

Cys Tyr Asp Lys 
185 



Thr Thr Trp Ser 
15 

Tyr Gly Pro Thr 
30 

Thr Ala Gly Gin 
45 

Gly Thr Tyr Ser 



Cys Trp Met He 
80 

Asp Asp His Pro 

95 . \ 

Gly Thr Asp Val ' 
110 

Leu Asn Asp Tyr 
125 

Glu Leu Leu Lys 



Gly Leu Phe Lys 
160 

Asn Ala Ala Leu 
175 

Ser Tyr Trp Ala 
190 



He Val Leu Ser Ala Ser Leu Met Gly Leu Phe Val Gin Gin Cys Gly 
195 200 205 
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Trp Leu Ala His Asp Phe Leu His Gin Gin Val Phe Glu Asn Arg Thr 
210 215 220 

Ala Asn Ser Phe Phe Gly Tyr Leu Phe Gly Asn Cys Val Leu Gly Phe 
225 230 235 240 

Ser Val Ser Trp Trp Arg Thr Lys His Asn lie His His Thr Ala Pro 
245 250 255 

Asn Glu Cys Asp Glu Gin Tyr Thr Pro Leu Asp Glu Asp He Asp Thr 
260 265 270 

Leu Pro He He Ala Trp Ser Lys Glu He Leu Ala Thr Val Glu Ser 
275 280 285 

Lys Arg He Leu Arg Val Leu Gin Tyr Gin His Tyr Met lie Leu Pro 
290 295 300 

Leu Leu Phe Met Ala Arg Tyr Ser Trp Thr Phe Gly Ser Leu Leu Phe 
305 310 315 320 

Thr Phe Asn Pro Asp Leu Ser Thr Thr Lys Gly Leu He Glu Lys Gly 
325 330 335 

Thr Val Ala Phe His Tyr Ala Trp Phe Ser Trp Ala Ala Phe His He 
340 345 350 

Leu Pro Gly Val Ala Lys Pro Leu Ala Trp Met Val Ala Thr Glu Leu 
355 360 365 

Val Ala Gly Leu Leu Leu Gly Phe Val Phe Thr Leu Ser His Asn Gly 
370 375 380 

Lys Glu Val Tyr Asn Glu Ser Lys Asp Phe Val Arg Ala Gin Val lie 
385 390 395 400 

Thr Thr Arg Asn Thr Lys Arg Gly Trp Phe Asn Asp Trp Phe Thr Gly 
405 410 415 



Gly Leu Asp Thr Gin He Glu His His Leu Phe Pro Thr Met Pro Arg 
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420 425 430 

His Asn Tyr Pro Lys lie Ala Pro Gin Val Glu Ala Leu Cys Lys Lys 
435 440 445 

His Gly Leu Glu Tyr Asp Asn Val Ser Val Val Gly Ala Ser Val Ala 
450 455 460 

Val Val Lys Ala Leu Lys Glu lie Ala Asp Glu Ala Ser lie Arg Leu 
465 470 475 480 

His Ala His 



<210> 9 . 
<211> 1467 

<212> DNA " - 

<213> Ceratodon purpureus 

<220> 

<221> CDS 

<222> (10) < 1461) 

<223> A6-desaturase 

<400> 9 

ggatccaaa atg gcc etc gtt acc gac ttt ctg aac ttt ctg ggc acg aca 51 

Met Ala Leu Val Thr Asp Phe Leu Asn Phe Leu Gly Thr Thr 
1 5 10 

tgg age aag tac age gtg tac acc cat age tat get gga aac tat ggg 99 
Trp Ser Lys Tyr Ser Val Tyr Thr His Ser Tyr Ala Gly Asn Tyr Gly. 
15 20 25 30 

cct act ttg aag cac gcc aaa aag gtt tct get caa ggt aaa act gcg 147 
Pro Thr Leu Lys His Ala Lys Lys Val Ser Ala Gin Gly Lys Thr Ala 

35 40 45 

gga cag aca ctg aga cag aga teg gtg cag gac aaa aag cca ggc act 195 
Gly Gin Thr Leu Arg Gin Arg Ser Val Gin Asp Lys Lys Pro Gly Thr 
50 55 60 
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tac tct ctg gcc gat gtt get tct cac gac agg cct gga gac tgc tgg 24 3 

Tyr Ser Leu Ala Asp Val Ala Ser His Asp Arg Pro Gly Asp Cys Trp 

65 70 75 

atg ate gtc aaa gag aag gtg tat gat att age cgt ttt gcg gac gac 291 

Met lie Val Lys Glu Lys Val Tyr Asp lie Ser Arg Phe Ala Asp Asp 
80 85 90 

cac cct gga ggg acg gta att age ace tac ttt ggg egg gat ggc aca 33 9 

His Pro Gly Gly Thr Val lie Ser Thr Tyr Phe Gly Arg Asp Gly Thr 

95 100 105 110 

gac gtt ttc gca aca ttc cat cca cct gcc gca tgg aag caa etc aat 3S7 

Asp Val Phe Ala Thr Phe His Pro Pro Ala Ala Trp Lys Gin Leu Asn 

115 120 125 

gac tac tac att gga gac ctt get agg gaa gag ccc ctt gat gaa ttg 435 

Asp Tyr Tyr lie Gly Asp Leu Ala Arg Glu Glu Pro Leu Asp Glu Leu 

130 135 140 

ctt aaa gac tac aga gat atg aga gcc gag ttt gtt aga gaa ggg ctt 4 83 

Leu Lys Asp Tyr Arg Asp Met Arg Ala Glu Phe Val Arg Glu Gly Leu 

145 150 - 155 

ttc aag agt tec aag gcc tgg ttc ctg ctt cag act ctg att aat gca 531 

Phe Lys Ser Ser Lys Ala Trp Phe Leu Leu Gin Thr Leu lie Asn Ala 
160 165 170 

get etc ttt get gcg age att gcg act ate tgt tac gac aag agt tac 579 

Ala Leu Phe Ala Ala Ser lie Ala Thr lie Cys Tyr Asp Lys Ser Tyr 

175 180 185 190 

tgg get att gtg ctg tea gcc agt ttg atg ggt etc ttc gtc caa cag 6 27 

Trp Ala lie Val Leu Ser Ala Ser Leu Met Gly Leu Phe Val Gin Gin 

195 200 205 



tgt gga tgg ctt gcc cat gat ttc ctt cat caa cag gtc ttt gag aac 
Cys Gly Trp Leu Ala His Asp Phe Leu His Gin Gin Val Phe Glu Asn 
210 215 220 



675 
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cgt acc gcg aac tec ttc ttt ggc tat ttg ttc ggc aat tgc gtg ctt 723 
Arg Thr Ala Asn Ser Phe Phe Gly Tyr Leu Phe Gly Asn Cys Val Leu 
225 230 235 

ggc ttt agt gta tea tgg tgg agg acg aag cac aac att cat cat act 771 
Gly Phe Ser Val Ser Trp Trp Arg Thr Lys His Asn lie His His Thr 
240 245 250 

get ccg aat gag tgc gac gaa cag tac aca cct eta gac gaa gac att 819 
Ala Pro Asn Glu Cys. Asp Glu Gin Tyr Thr Pro Leu Asp Glu Asp lie 
255 260 265 270 

gat act etc ccc ate att gec tgg age aag gaa att ttg gec acc gtt 8 67 
Asp Thr Leu Pro lie lie Ala Trp Ser Lys Glu lie Leu Ala Thr Val 
275 280 285 

gag age aag aga att ttg cga gtg ctt caa tat cag cac tac atg att 915 
Glu Ser Lys Arg lie Leu Arg Val Leu Gin Tyr Gin His Tyr Met lie 
290 . 295 300 

ctg cct eta ttg ttc atg gec egg tac agt tgg act ttt gga agt ttg 9 63 
Leu Pro Leu Leu Phe Met Ala Arg Tyr Ser Trp Thr Phe Gly Ser Leu 
305 310 . 315 

etc ttc aca ttc aat cct gat ttg age acg acc aag gga ttg ata gag 1011 
Leu Phe Thr Phe Asn Pro Asp Leu Ser Thr Thr Lys Gly Leu lie Glu 
320 325 330 

aag gga aca gtt get ttt cac tac gee tgg ttc agt tgg get gcg ttc 1059 
Lys Gly Thr Val Ala Phe His Tyr Ala Trp Phe Ser Trp Ala Ala Phe 
335 340 345 350 



cat att ttg ccg ggt gtc get aag cct ctt gcg tgg atg gta gca act 1107 
His lie Leu Pro Gly Val Ala Lys Pro Leu Ala Trp Met Val Ala Thr 
355 360 365 



gag ctt gtg gec ggt ttg ttg ttg gga ttc gtg ttt acg ttg agt cac 
Glu Leu Val Ala Gly Leu Leu Leu Gly Phe Val Phe Thr Leu Ser His 
370 375 380 



1155 
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egg ctt cac get cac taa gtcgac 
Arg Leu His Ala His 
480 



1203 



aat gga aag gag gtt tac aat gaa teg aag gac ttc gtg aga gee cag 
Asn Gly Lys Glu Val Tyr Asn Glu Ser Lys Asp Phe Val Arg Ala Gin 
385 390 395 

gtt att aec ace cgt aac ace aag cga ggc tgg ttc aac gat tgg ttc 1251 
Val He Thr Thr Arg Asn Thr Lys Arg Gly Trp Phe Asn Asp Trp Phe 
400 405 410 

act ggg gga etc gac aec cag att gag cat cac ctg ttt cca aca atg 1299 
Thr Gly Gly Leu Asp Thr Gin He Glu His His Leu Phe Pro Thr Met 
415 420 425 430 

ccc agg cac aac tac ccc aag ate gca cct cag gtc gag get ctt tgc 1347 
Pro Arg His Asn Tyr Pro Lys He Ala Pro Gin Val Glu Ala Leu Cys 
435 440 445 

aag aag cac ggc etc gag tac gat aat gtc tec gtc gtt ggt gec tct 13 9 5 
Lys Lys His Gly Leu Glu Tyr Asp Asn Val Ser Val Val Gly Ala Ser 
450 455 460 

gtc gcg gtt gtg aag gcg etc aag gaa att get gat gaa gcg tea att 1443 
Val Ala Val val Lys Ala Leu Lys Glu He Ala Asp Glu Ala Ser He 
465 470 - 475 



1467 



<210> 10 
<211> 483 
<212> PRT 

<213> Ceratodon purpureus 
<400> 10 

Met Ala Leu Val Thr Asp Phe Leu Asn Phe Leu Gly Thr Thr Trp Ser 
1 5 10 15 



Ly 



s Tyr ser Val Tyr Thr His Ser Tyr Ala Gly Asn Tyr Gly Pro Thr 
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20 25 '30 

Leu Lys His Ala Lys Lys Val Ser Ala Gin Gly Lys Thr Ala Gly Gin 
35 40 45 

Thr Leu Arg Gin Arg Ser Val Gin Asp Lys Lys Pro Gly Thr Tyr Ser 
50 55 60 

Leu Ala Asp Val Ala Ser His Asp Arg Pro Gly Asp Cys Trp Met He 
65 70 75 80 

Val Lys Glu Lys Val Tyr Asp He Ser Arg Phe Ala Asp Asp His Pro 

85 90 95 

Gly Gly Thr Val lie Ser Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val 
100 105 110 

Phe Ala Thr Phe His Pro Pro Ala Ala Trp Lys Gin Leu Asn Asp Tyr 
115 120 125 

Tyr He Gly Asp Leu Ala Arg Glu Glu Pro Leu Asp Glu Leu Leu Lys 
130 " 135' ■ 140 

Asp Tyr Arg Asp Met Arg Ala Glu Phe Val Arg Glu Gly Leu Phe Lys 
145 150 155 160 

Ser Ser Lys Ala Trp Phe Leu Leu Gin Thr Leu He Asn Ala Ala Leu 
165 170 175 

Phe Ala Ala Ser He Ala Thr He Cys Tyr Asp Lys Ser Tyr Trp Ala 
180 185 190 

He Val Leu Ser Ala Ser Leu Met Gly Leu Phe Val Gin Gin Cys Gly 
195 200 205 

Trp Leu Ala His Asp Phe Leu His Gin Gin Val Phe Glu Asn Arg Thr 
210 215 220 



Ala Asn Ser Phe Phe Gly Tyr Leu Phe Gly Asn Cys Val Leu Gly Phe 
225 230 235 240 
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Ser Val Ser Trp Trp Arg Thr Lys His Asn lie His His Thr Ala Pro 
245 250 255 

Asn Glu Cys Asp Glu Gin Tyr Thr Pro Leu Asp Glu Asp He Asp Thr 
260 265 270 

Leu Pro He He Ala Trp Ser Lys Glu He Leu Ala Thr Val Glu Ser 
275 280 285 

Lys Arg He Leu Arg Val Leu Gin Tyr Gin His Tyr Met He Leu Pro 
290 295 300 

Leu Leu Phe Met Ala Arg Tyr Ser Trp Thr Phe Gly Ser Leu Leu Phe 
305 310 315 ' 320 

Thr Phe Asn Pro Asp Leu Ser Thr Thr Lys Gly Leu He Glu Lys Gly 
325 330 335 

Thr Val Ala Phe His Tyr Ala Trp Phe Ser Trp Ala Ala Phe His He 
340 345 350 

Leu Pro Gly Val Ala Lys Pro Leu Ala Trp Met Val Ala Thr Glu Leu 
355 360 365 

Val Ala Gly Leu Leu Leu Gly Phe Val Phe Thr Leu Ser His Asn Gly 
370 375 380 

Lys Glu Val Tyr Asn Glu Ser Lys Asp Phe Val Arg Ala Gin Val He 
385 390 395 400 

Thr Thr Arg Asn Thr Lys Arg Gly Trp Phe Asn Asp Trp Phe Thr Gly 
405 410 415 

Gly Leu Asp Thr Gin He Glu His His Leu Phe Pro Thr Met Pro Arg 
420 425 430 



His Asn Tyr Pro Lys He Ala Pro Gin Val Glu Ala Leu Cys Lys Lys 
435 440 445 
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His Gly Leu Glu Tyr Asp Asn Val Ser Val Val Gly Ala Ser Val Ala 
450 455 460 

Val Val Lys Ala Leu Lys Glu lie Ala Asp Glu Ala Ser lie Arg Leu 
465 470 475 480 

His Ala His 



<210> 11 
<211> 2160 
<212> DNA 

<213> Ceratodon purpureus 

<220> 

<2 21> CDS 

<222> (159) (1721) 
<223> A6-desaturase 

<400> 11 

cggaggtctc ttgtcgttct tggagtctgt. gtcgagcttg gaatgcggta ggcgcggccg 60 

tttcgtggtt ttggcgttgg cattgcgcga gggcggacag tgggagtgcg ggaggtctgt 120 

ttgtgcatga cgaggtggtt gtaatcttcg ccggcaga atg gtg tec cag ggc ggc 176 

Met Val Ser Gin Gly Gly 
1 5 

ggt etc teg cag ggt tec att gaa gaa aac att gac gtt gag cac ttg 224 
Gly Leu Ser Gin Gly Ser lie Glu Glu Asn He Asp Val Glu His Leu 
10 15 20 

gca acg atg ccc etc gtc agt gac ttc eta aat gtc ctg gga acg act 27 2 
Ala Thr Met Pro Leu Val Ser Asp Phe Leu Asn Val Leu Gly Thr Thr 
25 30 35 

ttg ggc cag tgg agt ctt tec act aca ttc get ttc aag agg etc acg 320 
Leu Gly Gin Trp Ser Leu Ser Thr Thr Phe Ala Phe Lys Arg Leu Thr 
40 45 50 
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act aag aaa cac agt teg gac ate teg gtg gag gca caa aaa gaa teg 368 
Thr Lys Lys His Ser Ser Asp He Ser Val Glu Ala Gin Lys Glu Ser 
55 60 65 70 

gtt gcg egg ggg cca gtt gag aat att tct caa teg gtt gcg cag ccc 416 
val Ala Arg Gly Pro Val Glu Asn He Ser Gin Ser val Ala Gin Pro 
75 80 85 

ate agg egg agg tgg gtg cag gat aaa aag ccg gtt act tac age ctg 464 
He Arg Arg Arg Trp Val Gin Asp Lys Lys Pro Val Thr Tyr Ser Leu 
90 95 ioo 

aag gat gta get teg cac gat atg ccc cag gac tgc tgg att ata ate 512 
Lys Asp Val Ala Ser His Asp Met Pro Gin Asp Cys Trp He He He 
105 HO us 

aaa gag aag gtg tat gat gtg age acc ttc get gag cag cac cct gga 560 
Lys Glu Lys Val Tyr Asp Val Ser Thr Phe Ala Glu Gin His Pro Gly 
"0 125 130 

ggc acg gtt ate aac acc tac ttc gga cga gac gee aca gat gtt ttc 608 
Gly Thr Val He Asn Thr Tyr Phe Gly Arg Asp Ala Thr Asp Val Phe 
135 "0 145 150 

tct act ttc cac gca tec acc tea tgg aag att ctt cag aat ttc tac 656 
Ser Thr Phe His Ala Ser Thr Ser Trp Lys He Leu Gin Asn Phe Tyr 
155 160 1 65 

ate ggg aac ctt gtt agg gag gag ccg act ttg gag ctg ctg aag gag 704 
He Gly Asn Leu Val Arg Glu Glu Pro Thr Leu Glu Leu Leu Lys Glu 
170 175 180 

tac aga gag ttg aga gee ctt ttc ttg aga gaa cag ctt ttc aag agt 752 
Tyr Arg Glu Leu Arg Ala Leu Phe Leu Arg Glu Gin Leu Phe Lys Ser 
185 190 195 

tec aaa tec tac tac ctt ttc aag act etc ata aat gtt tec att gtt 800 
Ser Lys Ser Tyr Tyr Leu Phe Lys Thr Leu He Asn Val Ser He Val 
200 20S 210 
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gcc aca age att gcg ata ate agt ctg tac aag tct tab egg gcg gtt 848 

Ala Thr Ser lie Ala lie lie Ser Leu Tyr Lys Ser Tyr Arg Ala Val 
215 220 225 230 

ctg tta tea gcc agt ttg atg ggc ttg ttt att caa cag tgc gga tgg B96 

Leu Leu Ser Ala Ser Leu Met Gly Leu Phe lie Gin Gin Cys Gly Trp 
235 240 245 

ttg tct cac gat ttt eta cac cat cag gta ttt gag aca cgc tgg etc 944 

Leu Ser His Asp Phe Leu His His Gin Val Phe Glu Thr Arg Trp Leu 
250 255 260 

aat gac gtt gtt ggc tat gtg gtc ggc aac gtt gtt ctg gga ttc agt 9 92 

Asn Asp Val Val Gly Tyr Val Val Gly Asn Val Val Leu Gly phe Ser 

265 270 275 

gtc teg tgg tgg aag ace aag cac aac ctg cat cat get get ccg aat 1040 

Val Ser Trp Trp Lys Thr Lys His Asn Leu His His Ala Ala Pro Asn 
280 285 290 

gaa tgc gac caa aag tac aca ccg att gat gag gat att gat act etc 1088 

Glu Cys Asp Gin Lys Tyr Thr Pro lie Asp Glu Asp lie Asp Thr Leu" 
295 300 305 310 

ccc ate att get tgg agt aaa gat etc ttg gcc act gtt gag age aag 1136 

Pro He lie Ala Trp Ser Lys Asp Leu Leu Ala Thr Val Glu Ser Lys 
315 320 325 

acc atg ttg cga gtt ctt cag tac cag cac eta ttc ttt ttg gtt ctt 1184 

Thr Met Leu Arg Val Leu Gin Tyr Gin His Leu Phe Phe Leu Val Leu 
330 335 340 

ttg acg ttt gcc egg gcg agt tgg eta ttt tgg age gcg gcc ttc act 1232 

Leu Thr Phe Ala Arg Ala Ser Trp Leu Phe Trp Ser Ala Ala Phe Thr 

345 350 355 

etc agg ccc gag ttg acc ctt ggc gag aag ctt ttg gag agg gga acg 1280 

Leu Arg Pro Glu Leu Thr Leu Gly Glu Lys Leu Leu Glu Arg Gly Thr 
360 365 370 
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atg get ttg cac tac att tgg ttt aat agt gtt gcg ttt tat ctg etc 1328 

Met Ala Leu His Tyr lie Trp Phe Asn Ser Val Ala Phe Tyr Leu Leu 
375 380 385 390 

ccc gga tgg aaa cca gtt gta tgg atg gtg gtc age gag etc atg tct 1376 
Pro Gly Trp Lys Pro Val Val Trp Met Val Val Ser Glu Leu Met Ser 
395 400 405 

ggt ttc ctg ctg gga tac gta ttt gta etc agt cac aat gga atg gag 1424 
Gly Phe Leu Leu Gly Tyr Val Phe Val Leu Ser His Asn Gly Met Glu 
410 415 420 



gtg tac aat acg tea aag gac ttc gtg aat gec cag att gca teg act 
Val Tyr Asn Thr Ser Lys Asp Phe Val Asn Ala Gin lie Ala Ser Thr 
425 430 435 

cgc gac ate aaa gca ggg gtg ttt aat gat tgg ttc acc gga ggt etc 
Arg Asp lie Lys Ala Gly Val Phe Asn Asp Trp Phe Thr Gly Gly Leu 
440 445 450 

aac aga cag att gag cat cat eta ttt cca acg atg ccc agg cac aac 
Asn Arg Gin He Glu His His Leu Phe Pro Thr Met Pro Arg His Asn 
455 460 465 470 

ctt aat aaa att tct cct cac gtg gag act ttg tgc aag aag cat gga 
Leu Asn Lys He Ser Pro His Val Glu Thr Leu Cys Lys Lys His Gly 
475 480 485 

ctg gtc tac gaa gac gtg age atg get teg ggc act tac egg gtt ttg 
Leu Val Tyr Glu Asp Val Ser Met Ala Ser Gly Thr Tyr Arg Val Leu 
490 495 500 

aaa aca ctt aag gac gtt gec gat get get tea cac cag cag ctt get 
Lys Thr Leu Lys Asp Val Ala Asp Ala Ala Ser His Gin Gin Leu Ala 
505 510 515 



1472 



1520 



1568 



1616 



1664 



1712 



gcg agt tga ggcatcgcag cactcgtcga aacatttttg tctgttatag 
Ala Ser 
520 



1761 
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tgttcatatg tgatcgaggg gaaaaggtcc catgctctga tctatt'cttc tgtagccaat 1821 

atttttcaat tgaaaggagg ttcctcactt atcttccatc tatcgttgca catcctgcat 1881 

cagagttagc gttggagtaa tgttaagcac ttgtagatta tgcccaccat tgccacattt 1941 

ctgttcggtt acaatcgttt gattccatgc tatcctccgt gtt.cat.ctcg ttgttataag 2 001 

caagcttgaa aaaacatgct acgagattgg cagacgttgt cttggcagct gtagaggttg 2061 

gttccattca ttgtgtagta cagaactctc tcgtccctgt ttctctacat tacttgttac 2121 

atagtgactt tcattcacag caaaaaaaaa aaaaaaaaa 2160 



<210> 12 
<211> 520 

<212> PRT - 
<213> Ceratodon purpureus 

<400> 12 .... - 

Met Val Ser Gin Gly Gly Gly Leu Ser Gin Gly Ser lie Glu Glu Asn 
1 5 10 15 

He Asp Val Glu His Leu Ala Thr Met Pro Leu Val Ser Asp Phe Leu 
20 25 30 

Asn Val Leu Gly Thr Thr Leu Gly Gin Trp Ser Leu Ser Thr Thr Phe 
35 40 45 

Ala Phe Lys Arg Leu Thr Thr Lys Lys His Ser Ser Asp He Ser Val 
50 55 60 

Glu Ala Gin Lys Glu Ser Val Ala Arg Gly Pro Val Glu Asn He Ser 
65 70 75 80 

Gin Ser Val Ala Gin Pro He Arg Arg Arg Trp Val Gin Asp Lys Lys 

85 90 95 

Pro Val Thr Tyr Ser Leu Lys Asp Val Ala Ser His Asp Met Pro Gin 
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100 105 ' 110 

Asp cys Trp He He He Lys Glu Lys Val Tyr Asp Val ser Thr Phe 
H5 120 125 

Ala Glu Gin His Pro Gly Gly Thr Val He Asn Thr Tyr Phe Gly Arg 
130 135 140 

Asp Ala Thr Asp Val Phe Ser Thr Phe His Ala Ser Thr Ser Trp Lys 
145 150 155 160 

lie Leu Gin Asn Phe Tyr He Gly Asn Leu Val Arg Glu Glu Pro Thr 
165 170 175 

Leu Glu Leu Leu Lys Glu Tyr Arg Glu Leu Arg Ala Leu Phe Leu Arg 
180 185 190 

Glu Gin Leu Phe Lys Ser Ser Lys Ser Tyr Tyr Leu Phe Lys Thr Leu 
195 200 205 

He Asn Val Ser He Val Ala Thr Ser He Ala He He Ser Leu Tyr 
210 215 220 

Lys Ser Tyr Arg Ala Val Leu Leu Ser Ala Ser Leu Met Gly Leu Phe 
225 230 235 240 

He Gin Gin Cys Gly Trp Leu Ser His Asp Phe Leu His His Gin Val 
245 250 255 

Phe Glu Thr Arg Trp Leu Asn Asp Val Val Gly Tyr Val Val Gly Asn 
260 265 270 

Val Val Leu Gly Phe Ser Val Ser Trp Trp Lys Thr Lys His Asn Leu 
275 280 285 

His His Ala Ala Pro Asn Glu Cys Asp Gin Lys Tyr Thr Pro He Asp 
290 295 300 

Glu Asp He Asp Thr Leu Pro He He Ala Trp Ser Lys Asp Leu Leu 
305 310 315 320 
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Ala Thr Val Glu Ser Lys Thr Met Leu Arg Val Leu Gin Tyr Gin His 
325 330 335 

Leu Phe Phe Leu Val Leu Leu Thr Phe Ala Arg Ala Ser Trp Leu Phe 
340 345 350 

Trp Ser Ala Ala Phe Thr Leu Arg Pro Glu Leu Thr Leu Gly Glu Lys 
355 360 365 

Leu Leu Glu Arg Gly Thr Met Ala Leu His Tyr lie Trp Phe Asn Ser 
370 375 380 

Val Ala Phe Tyr Leu Leu Pro Gly Trp Lys Pro Val Val Trp Met Val 
385 390 395 400 

Val Ser Glu Leu Met Ser Gly Phe Leu Leu Gly Tyr Val Phe Val Leu 
4 05 410 415 

Ser His Asn Gly Met Glu Val Tyr Asn Thr Ser Lys Asp Phe Val Asn 
420 : 425 . 430 

Ala Gin lie Ala Ser Thr Arg Asp lie Lys Ala Gly Val Phe Asn Asp 
435 440 . 445. 

Trp Phe Thr Gly Gly Leu Asn Arg Gin lie Glu His His Leu Phe Pro 
450 455 460 

Thr Met Pro Arg His Asn Leu Asn Lys He Ser Pro His Val Glu Thr 
465 470 475 480 

Leu Cys Lys Lys His Gly Leu Val Tyr Glu Asp Val Ser Met Ala Ser 
485 490 495 

Gly Thr Tyr Arg Val Leu Lys Thr Leu Lys Asp Val Ala Asp Ala Ala 
500 505 510 



Ser His Gin Gin Leu Ala Ala Ser 
515 520 
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<210> 13 

<211> 1434 

<212> DNA 

<213> Phaeodactylum 



34 



tricornutum 



<220> 

<221> CDS 

<222> (1)..<1434) 

<223> A6-desaturase 



<400> 13 

atg ggc aaa gga ggg gac get egg gec teg aag ggc tea acg gcg get 
Met Gly Lys Gly Gly Asp Ala Arg Ala Ser Lys Gly Ser Thr Ala Ala 
1 5 10 15 

cgc aag ate agt tgg cag gaa gtc aag acc cac gcg tct ccg gag gac 
Arg Lys He Ser Trp Gin Glu Val Lys Thr His Ala Ser Pro Glu Asp 
20 25 30 

gec tgg ate att cac tec aat aag gtc tac gac gtg tec aac tgg cac 
Ala Trp lie He His Ser Asn Lys Val Tyr Asp Val Ser Asn Trp His 
35 40 45 

gaa cat ccc gga ggc gec gtc att ttc acg cac gee ggt gac gac atg 
Glu His Pro Gly Gly Ala Val He Phe Thr His Ala Gly Asp Asp Met 
50 55 60 

acg gac att ttc get gee ttt cac gca ccc gga teg cag teg etc atg 
Thr Asp lie Phe Ala Ala Phe His Ala Pro Gly Ser Gin Ser Leu Met 
65 70 75 80 

aag aag ttc tac att ggc gaa ttg etc ccg gaa acc acc ggc aag gag 
Lys Lys Phe Tyr He Gly Glu Leu Leu Pro Glu Thr Thr Gly Lys Glu 

85 90 95 



48 



96 



144 



192 



240 



288 



ccg cag caa ate gec ttt gaa aag ggc tac cgc gat ctg cgc tec aaa 
Pro Gin Gin He Ala Phe Glu Lys Gly Tyr Arg Asp Leu Arg Ser Lys 
100 105 HO 



336 
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etc ate atg atg ggc atg ttc aag tec aac aag tgg ttc tac gtc tac 384 
Leu He Met Met Gly Met Phe Lys Ser Asn Lys Trp Phe Tyr Val Tyr 
115 120 125 

aag tgc etc age aac atg gec att tgg gee gee gec tgt get etc gtc 432 
Lys Cys Leu Ser Asn Met Ala He Trp Ala Ala Ala Cys Ala Leu Val 
130 135 140 

ttt tac teg gac cgc ttc tgg gta cac ctg gec age gec gtc atg ctg 4 80 
Phe Tyr Ser Asp Arg Phe Trp Val His Leu Ala Ser Ala Val Met Leu 
145 150 155 160 

gga aca ttc ttt cag cag teg gga tgg ttg gca cac gac ttt ctg cac 528 
Gly Thr Phe Phe Gin Gin Ser Gly Trp Leu Ala His Asp Phe Leu His 
165 170 175 

cac cag gtc ttc acc aag cgc aag cac ggg gat etc gga gga etc ttt 576 
His Gin Val Phe Thr Lys Arg Lys His Gly Asp Leu Gly Gly Leu Phe 
180 185 190 

tgg ggg aac etc atg cag ggt tac tec gta cag tgg tgg aaa aac aag 624 
Trp Gly Asn Leu Met Gin Gly Tyr Ser Val Gin Trp Trp Lys Asn Lys 
195 200 205 

cac aac gga cac cac gec gtc ccc aac etc cac tgc tec tec gca gtc 672 
His Asn Gly His His Ala Val Pro Asn Leu His Cys Ser Ser Ala Val 
210 215 220 .. 

gcg caa gat ggg gac ccg gac ate gat acc atg ccc ctt etc gec tgg 7 20 
Ala Gin Asp Gly Asp Pro Asp lie Asp Thr Met Pro Leu Leu Ala Trp 
225 230 235 240 

tec gtc cag caa gec cag tct tac egg gaa etc caa gec gac gga aag 7 68 
Ser Val Gin Gin Ala Gin Ser Tyr Arg Glu Leu Gin Ala Asp Gly Lys 
245 250 255 

gat teg ggt ttg gtc aag ttc atg ate cgt aac caa tec tac ttt tac 816 
Asp Ser Gly Leu Val Lys Phe Met He Arg Asn Gin Ser Tyr Phe Tyr 
260 265 270 
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ttt ccc ate 
Phe Pro lie 
275 

aag tgc gec 
Lys Cys Ala 
290 

etc aag gec 
Leu Lys Ala 
305 

ctg ctg cac 
Leu Leu His 



ttc teg ttc 
Phe Ser Phe 



tgt gga ttc 
Cys Gly Phe 
355 

gec acc tac 
Ala Thr Tyr 
370 

acc acg act 
Thr Thr Thr 
385 

gtc gac tgg 
Val Asp Trp 

ttc ccc age 
Phe Pro Ser 



ttg ttg etc 
Leu Leu Leu 



ttt ggg ctt 
Phe Gly Leu 

aag ggt ctt 
Lys Gly Leu 
310 

tac get tgg 
Tyr Ala Trp 
325 

gcg tac acc 

Ala Tyr Thr 
340 

ttg etc gee 

Leu Leu Ala 

aat gee gac 
Asn Ala Asp 

cgc aac gtc 
Arg Asn Val 
390 

ttc tgt ggt 
Phe Cys Gly 
405 

ctg ccc cga 
Leu Pro Arg 
420 



gec cgc ctg 
Ala Arg Leu 
280 

gga get gcg 
Gly Ala Ala 
295 

cag tac ccc 
Gin Tyr Pro 

atg ctt aca 
Met Leu Thr 



gca ttt tac 
Ala Phe. Tyr 
345 

att gtc ttt 
lie Val Phe 
360 

gec cgt ccg 
Ala Arg Pro 
375 

acg ggc gga 
Thr Gly Gly 



ggc etc cag 
Gly Leu Gin 

cac aat ctg 
His Asn Leu 
425 



36 

teg tgg ttg 
Ser Trp Leu 

teg gag aac 
Ser Glu Asn 
300 

ctt ttg gaa 
Leu Leu Glu 
315 

gtt teg tec 
Val Ser Ser 
330 

ttt eta acc 
Phe Leu Thr 



ggc etc ggc 
Gly Leu Gly 

gac ttc tgg 
Asp Phe Trp 
380 

cac ggt ttc 
His Gly Phe 
395 

tac caa gtc 
Tyr Gin Val 
410 

gec aag aca 
Ala Lys Thr 



aac gag tec 
Asn Glu Ser 
285 

get get etc 
Ala Ala Leu 



aag get ggc 
Lys Ala Gly 



ggc ttt gga 
Gly Phe Gly 
335 

gcg acc gcg 
Ala Thr Ala 
350 

cac aac ggc 
His Asn Gly 
365 

aag etc caa 
Lys Leu Gin 

ccc caa gec 
Pro Gin Ala 



gac cac cac 
Asp His His 
415 

cac gca ctg 
His Ala Leu 
430 



ttc 864 
Phe 



gaa 912 
Glu 



ate 960 

lie 

320 

cgc 1008 
Arg 

tec 1056 
Ser 



atg 1104 
Met 



gtc 1152 
Val 



ttt 1200 

Phe 

400 

tta 1248 
Leu 

gtc 1296 
Val 
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gaa teg ttc tgc aag gag 
Glu Ser Phe Cys Lys Glu 
435 

gtg gac ggg acc atg gaa 
Val Asp Gly Thr Met Glu 
450 

gaa ttc gtc gtg gat ttt 
Glu Phe Val Val Asp Phe 
465 470 



37 

tgg ggt gtc cag tac 
Trp Gly Val Gin Tyr 
440 

gtc ttg cac cat ttg 
Val Leu His His Leu 
455 

gta cgc gat gga. ccc 
Val Arg Asp Gly Pro 
475 



cac gaa gec gac ctt 1344 
His Glu Ala Asp Leu 
445 

ggc age gtg gec ggc 1392 

Gly Ser Val Ala Gly 

460 

gec atg taa 1434 
Ala Met 



<210> 14 
<211> 477 
<212> PRT 

<213> Phaeodactylum tricornutum 
<400> 14 

Met Gly Lys Gly Gly Asp Ala Arg Ala Ser Lys Gly Ser Thr Ala Ala 
1 5 10 - 15 

Arg Lys lie Ser Trp Gin Glu Val Lys Thr His Ala Ser Pro Glu Asp 
20 . 25 30 

Ala Trp lie lie His Ser Asn Lys Val Tyr Asp Val Ser Asn Trp His 
35 40 45 

Glu His Pro Gly Gly Ala Val He Phe Thr His Ala Gly Asp Asp Met 
50 55 60 

Thr Asp He Phe Ala Ala Phe His Ala Pro Gly Ser Gin Ser Leu Met 
65 70 75 80 

Lys Lys Phe Tyr He Gly Glu Leu Leu Pro Glu Thr Thr Gly Lys Glu 

85 90 95 

Pro Gin Gin lie Ala Phe Glu Lys Gly Tyr Arg Asp Leu Arg Ser Lys 
100 105 110 
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Leu He Met Met Gly Met Phe Lys Ser Asn Lys Trp Phe Tyr Val Tyr 
115 120 125 

Lys Cys Leu Ser Asn Met Ala He Trp Ala Ala Ala Cys Ala Leu Val 
130 135 140 

Phe Tyr Ser Asp Arg Phe Trp Val His Leu Ala Ser Ala Val Met Leu 
145 150 155 160 

Gly Thr Phe Phe Gin Gin Ser Gly Trp Leu Ala His Asp Phe Leu His 
165 170 175 

His Gin Val Phe Thr Lys Arg Lys His Gly Asp Leu Gly Gly Leu Phe 
180 185 190 

Trp Gly Asn Leu Met Gin Gly Tyr Ser Val Gin Trp Trp Lys Asn Lys 
195 200 205 

His Asn Gly His His Ala Val Pro Asn Leu His Cys Ser Ser Ala Val 
210 215 220 

Ala Gin Asp Gly Asp Pro Asp lie Asp Thr Met Pro Leu Leu Ala Trp 
225 230 235 240 

Ser Val Gin Gin Ala Gin Ser Tyr Arg Glu Leu Gin Ala Asp Gly Lys 
245 250 255 

Asp Ser Gly Leu Val Lys Phe Met He Arg Asn Gin Ser Tyr Phe Tyr 
260 265 270 

Phe Pro He Leu Leu Leu Ala Arg Leu Ser Trp Leu Asn Glu Ser Phe 
275 280 285 

Lys Cys Ala Phe Gly Leu Gly Ala Ala Ser Glu Asn Ala Ala Leu Glu 
290 295 300 

Leu Lys Ala Lys Gly Leu Gin Tyr Pro Leu Leu Glu Lys Ala Gly He 
305 310 315 320 



Leu Leu His Tyr Ala Trp Met Leu Thr Val Ser Ser Gly Phe Gly Arg 
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325 330 ' 335 

Phe Ser Phe Ala Tyr Thr Ala Phe Tyr Phe Leu Thr Ala Thr Ala Ser 
340 345 350 

Cys Gly Phe Leu Leu Ala lie Val Phe Gly Leu Gly His Asn Gly Met 
355 360 365 

Ala Thr Tyr Asn Ala Asp Ala Arg Pro Asp Phe Trp Lys Leu Gin Val 
370 375 380 

Thr Thr Thr Arg Asn Val Thr Gly Gly His Gly Phe Pro Gin Ala Phe 
38'5 390 395 . 400 

Val Asp Trp Phe Cys Gly Gly Leu Gin Tyr Gin Val Asp His His Leu 
405 410 415 

Phe Pro Ser Leu Pro Arg His Asn Leu Ala Lys Thr His Ala Leu Val 
420 425 430 

Glu Ser Phe Cys Lys Glu Trp Gly Val Gin Tyr His Glu Ala Asp Leu 
435 440 445 

Val Asp Gly Thr Met Glu Val Leu His His Leu Gly Ser.. Val Ala Gly 
450 455' 4 60 

Glu Phe Val Val Asp Phe Val Arg Asp Gly Pro Ala Met 
465 470 475 



<210> 15 
<211> 1563 
<212> DNA 

<213> Ceratodon purpureus 

<220> 

<221> CDS 

<222> (1)..(1563) 

<223> A6-desaturase 
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<400> 15 

atg gtg tec cag ggc ggc ggt etc teg cag ggt tec att gaa gaa aac 48 
Met Val Ser Gin Gly Gly Gly Leu Ser Gin Gly Ser lie Glu Glu Asn 
15 10 15 

att gac gtt gag cac ttg gca acg atg ccc etc gtc agt gac ttc eta 96 
lie Asp Val Glu His Leu Ala Thr Met Pro Leu Val Ser Asp Phe Leu 
20 25 30 

aat gtc ctg gga acg act ttg ggc cag tgg agt ctt tec act aca ttc 144 
Asn Val Leu Gly Thr Thr Leu Gly Gin Trp Ser Leu Ser Thr Thr Phe 
35 40 45 

get ttc aag agg etc acg act aag aaa cac agt teg gac ate teg gtg 192 
Ala Phe Lys Arg Leu Thr Thr Lys Lys His Ser Ser Asp lie Ser Val 
50 55 60 

gag gca caa aaa gaa teg gtt gcg egg ggg cca gtt gag aat att tct 240 
Glu Ala Gin Lys Glu Ser Val Ala Arg Gly Pro Val Glu Asn lie Ser 
65 70 75 80 

caa teg gtt gcg cag ccc ate agg egg agg tgg gtg cag gat aaa aag 288 
Gin Ser Val Ala Gin Pro lie Arg Arg Arg Trp Val Gin Asp Lys Lys 

85 .90 .95 

ccg gtt act tac age ctg aag gat gta get teg cac gat atg ccc cag 3 36 
Pro Val Thr Tyr Ser Leu Lys Asp Val Ala Ser His Asp Met Pro Gin 
100 105 110 

gac tgc tgg att ata ate aaa gag aag gtg tat gat gtg age acc ttc 3 84 
Asp Cys Trp lie lie lie Lys Glu Lys Val Tyr Asp Val Ser Thr Phe 
115 120 125 

get gag cag cac cct gga ggc acg gtt ate aac acc tac ttc gga cga 432 
Ala Glu Gin His Pro Gly Gly Thr Val He Asn Thr Tyr Phe Gly Arg 
130 135 140 

gac gee aca gat gtt ttc tct act ttc cac gca tec acc tea tgg aag 480 
Asp Ala Thr Asp Val Phe Ser Thr Phe His Ala Ser Thr Ser Trp Lys 
145 150 155 160 
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att ctt cag aat ttc -tac ate ggg aac ctt gtt agg gag gag ccg act 528 

lie Leu Gin Asn Phe Tyr lie Gly Asn Leu Val Arg Glu Glu Pro Thr 

165 170 175 

ttg gag ctg ctg aag gag tac aga gag ttg aga gcc ctt ttc ttg aga 576 

Leu Glu Leu Leu Lys Glu Tyr Arg Glu Leu Arg Ala Leu Phe Leu Arg 
180 185 190 

gaa cag ctt ttc aag agt tec aaa tec tac tac ctt ttc aag act etc 624 

Glu Gin Leu Phe Lys Ser Ser Lys Ser Tyr Tyr Leu Phe Lys Thr Leu 
195 200 205 

ata aat gtt tec att gtt gcc aca age att gcg ata ate agt ctg tac 672 

lie Asn Val Ser He Val Ala Thr Ser He Ala lie lie Ser Leu Tyr 
210 215 220 

aag tct tac egg. gcg gtt ctg tta tea gcc agt ttg atg ggc ttg ttt 7 20 

Lys Ser Tyr Arg Ala Val Leu Leu Ser Ala Ser Leu Met Gly Leu Phe 

225 230 235 240 

att caa cag tgc gga tgg ttg tct cac gat ttt eta cac cat cag gta 768 

He Gin Gin Cys Gly Trp Leu Ser His Asp Phe Leu His His Gin Val 

245 250 255 

ttt gag aca cgc tgg etc aat gac gtt gtt ggc tat gtg gtc ggc aac 816 

Phe Glu Thr Arg Trp Leu Asn Asp Val Val Gly Tyr Val Val Gly Asn 
260 265 270 

gtt gtt ctg gga ttc agt gtc teg tgg tgg aag ace aag cac aac ctg 8 64 

Val Val Leu Gly Phe Ser Val Ser Trp Trp Lys Thr Lys His Asn Leu 
275 280 285 

cat cat get get ccg aat gaa tgc gac caa aag tac aca ccg att gat 912 

His His Ala Ala Pro Asn Glu Cys Asp Gin Lys Tyr Thr Pro lie Asp 
290 295 300 

gag gat att gat act etc ccc ate att get tgg agt aaa gat etc ttg 9 60 

Glu Asp He Asp Thr Leu Pro He He Ala Trp Ser Lys Asp Leu Leu 

305 310 315 320 
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gcc act gtt gag age aag acc atg ttg cga gtt ctt cag tac cag cac 10 08 

Ala Thr Val Glu Ser Lys Thr Met Leu Arg Val Leu Gin Tyr Gin His 
325 330 335 

eta ttc ttt ttg gtt ctt ttg acg ttt gcc egg gcg agt tgg eta ttt 105 6 

Leu Phe Phe Leu Val Leu Leu Thr Phe Ala Arg Ala Ser Trp Leu Phe 

340 345 350 

tgg age gcg gcc ttc act etc agg ccc gag ttg acc ctt ggc gag aag 1104 

Trp Ser Ala Ala Phe Thr Leu Arg Pro Glu Leu Thr Leu Gly Glu Lys 
355 360 365 

ctt ttg gag agg gga acg atg get ttg cac tac att tgg ttt aat agt 1152 

Leu Leu Glu Arg Gly Thr Met Ala Leu His Tyr He Trp Phe Asn Ser 
370 375 380 

gtt gcg ttt tat ctg etc ccc gga tgg aaa cca gtt gta tgg atg gtg 12 00 

Val Ala Phe Tyr Leu Leu Pro Gly Trp Lys Pro Val Val Trp Met Val 
385 390 395 400 

gtc age gag etc atg tct ggt ttc ctg ctg gga tac gta ttt gta etc 1248 

Val Ser Glu Leu Met Ser Gly Phe Leu Leu Gly Tyr Val Phe Val Leu 
405 .410 415 

agt cac aat gga atg gag gtg tac aat acg tea aag gac ttc gtg aat 1296 

Ser His Asn Gly Met Glu Val Tyr Asn Thr Ser Lys Asp Phe Val Asn 

420 425 430 

gcc cag att gca teg act cgc gac ate aaa gca ggg gtg ttt aat gat 1344 

Ala Gin He Ala Ser Thr Arg Asp He Lys Ala Gly Val Phe Asn Asp 
435 440 445 

tgg ttc acc gga ggt etc aac aga cag att gag cat cat eta ttt cca 1392 

Trp Phe Thr Gly Gly Leu Asn Arg Gin He Glu His His Leu Phe Pro 
450 455 460 

acg atg ccc agg cac aac ctt aat aaa att tct cct cac gtg gag act 1440 

Thr Met Pro Arg His Asn Leu Asn Lys lie Ser Pro His Val Glu Thr 
465 470 475 480 
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ttg tgc aag aag cat. gga ctg gtc tac gaa gac g-tg age atg get teg 14 88 

Leu Cys Lys Lys His Gly Leu Val Tyr. Glu Asp Val Ser Met Ala Ser 
485 490 495 

ggc act tac egg gtt ttg aaa aca ctt aag gac gtt gec gat get get, 1536 
Gly Thr Tyr Arg Val Leu Lys Thr Leu Lys Asp Val Ala Asp Ala Ala 
500 505 510 

tea cac cag cag ctt get gcg agt tga 1563 
Ser His Gin Gin Leu Ala Ala Ser 
515 520 



<210> 16 
<211> 520 
<212> PRT 

<213> Ceratodon purpureus 
<400> 16 

Met Val Ser Gin Gly Gly Gly Leu Ser Gin Gly Ser lie Glu Glu Asn 
1 5 V 10 15 

lie Asp Val Glu His Leu Ala Thr Met Pro Leu Val Ser Asp Phe Leu 
20 25 30 

Asn Val Leu Gly Thr Thr Leu Gly Gin Trp Ser Leu Ser Thr Thr Phe 
35 40 45 

Ala Phe Lys Arg Leu Thr Thr Lys Lys His Ser Ser Asp lie Ser Val 
50 55 60 

Glu Ala Gin Lys Glu Ser Val Ala Arg Gly Pro Val Glu Asn lie Ser 
65 70 .75 80 

Gin Ser Val Ala Gin Pro lie Arg Arg Arg Trp Val Gin Asp Lys Lys 

85 90 95 



Pro Val Thr Tyr Ser Leu Lys Asp Val Ala Ser His Asp Met Pro Gin 
100 105 110 
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Asp cys Trp He He He Lys Glu Lys Val Tyr Asp Val Ser Thr Phe 
U5 120 125 

Ala Glu Gin His Pro Gly Gly Thr Val He Asn Thr Tyr Phe Gly Arg 
130 135 140 

Asp Ala Thr Asp Val Phe Ser Thr Phe His Ala Ser Thr Ser Trp Lys 
145 150 155 160 

He Leu Gin Asn Phe Tyr He Gly Asn Leu val Arg Glu Glu Pro Thr 
165 170 175 

Leu Glu Leu Leu Lys Glu Tyr Arg Glu Leu Arg Ala Leu Phe Leu Arg 
180 185 190 

Glu Gin Leu Phe Lys Ser Ser Lys Ser Tyr Tyr Leu Phe Lys Thr Leu 
195 200 205 

lie Asn val Ser He Val Ala Thr Ser He Ala He He Ser Leu Tyr 
210 215 220 

Lys Ser Tyr Arg Ala Val Leu Leu Ser Ala Ser Leu Met Gly Leu Phe 
225 230 - 235 240 

He Gin Gin Cys Gly Trp Leu Ser His Asp Phe Leu His His Gin Val 
245 250 255 

Phe Glu Thr Arg Trp Leu Asn Asp Val Val Gly Tyr Val Val Gly Asn 
260 265 270 

Val Val Leu Gly Phe Ser Val Ser Trp Trp Lys Thr Lys His Asn Leu 
275 280 285 

His His Ala Ala Pro Asn Glu Cys Asp Gin Lys Tyr Thr Pro He Asp 
290 295 300 

Glu Asp He Asp Thr Leu Pro He He Ala Trp Ser Lys Asp Leu Leu 
305 310 315 320 
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Ala Thr Val Glu Ser Lys Thr Met Leu Arg Val Leu Gin Tyr Gin His 
325 330 335 

Leu Phe Phe Leu Val Leu Leu Thr Phe Ala Arg Ala Ser Trp Leu Phe 
340 345 350 

Trp Ser Ala Ala Phe Thr Leu Arg Pro Glu Leu Thr Leu Gly Glu Lys 
355 360 365 

Leu Leu Glu Arg Gly Thr Met Ala Leu His Tyr lie Trp Phe Asn Ser 
370 375 380 

Val Ala Phe Tyr Leu Leu Pro Gly Trp Lys Pro Val Val Trp Met Val 
385 390 395 400 

Val Ser Glu Leu Met Ser Gly Phe Leu Leu Gly Tyr Val Phe Val Leu 
405 410 . 415 

Ser His Asn Gly Met. Glu Val Tyr Asn Thr Ser Lys Asp Phe Val Asn 
420 425 430 

Ala Gin He Ala Ser Thr Arg Asp lie Lys Ala Gly Val Phe Asn Asp 
435 440 445 

Trp Phe Thr Gly Gly Leu Asn Arg Gin He Glu His His Leu Phe Pro 
450 455 460 

Thr Met Pro Arg His Asn Leu Asn Lys He Ser Pro His Val Glu Thr 
465 470 475 480 

Leu Cys Lys Lys His Gly Leu Val Tyr Glu Asp Val Ser Met Ala Ser 
485 490 495 

Gly Thr Tyr Arg Val Leu Lys Thr Leu Lys Asp Val Ala Asp Ala Ala 
500 505 510 



Ser His Gin Gin Leu Ala Ala Ser 
515 520 
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<210> 17 
<211> 1578 
<212> DNA 

<213> Physcomitrella patens 

<220> 

<221> CDS 

<222> (1)..(1578) 

<223> A6-desaturase 

<400> 17 

atg gta ttc gcg ggc ggt gga ctt cag cag ggc tct etc gaa gaa aac 

Met Val Phe Ala Gly Gly Gly Leu Gin Gin Gly Ser Leu Glu Glu Asn 
15 10 15 

ate gac gtc gag cac att gec agt atg tct etc ttc age gac ttc ttc 
He Asp Val Glu His He Ala Ser Met Ser Leu Phe Ser Asp Phe Phe 
20 25 30 

agt tat gtg tct tea act gtt ggt teg tgg age gta cac agt ata caa 
Ser Tyr Val Ser Ser Thr Val Gly Ser Trp Ser Val His Ser He Gin 
35 40 45 

cct ttg aag cgc ctg acg agt aag aag cgt gtt teg gaa age get gee 
Pro Leu Lys Arg Leu Thr Ser Lys Lys Arg Val Ser Glu Ser Ala Ala 
50 55 60 

gtg caa tgt ata tea get gaa gtt cag aga aat teg agt acc cag gga 
Val Gin Cys He Ser Ala Glu Val Gin Arg Asn Ser Ser Thr Gin Gly 
65 70 75 80 

act gcg gag gca etc gca gaa tea gtc gtg aag ccc acg aga cga agg 
Thr Ala Glu Ala Leu Ala Glu Ser Val Val Lys Pro Thr Arg Arg Arg 

85 90 95 

tea tct cag tgg aag aag teg aca cac ccc eta tea gaa gta gca gta 
Ser Ser Gin Trp Lys Lys Ser Thr His Pro Leu Ser Glu Val Ala Val 
100 105 110 



48 



96 



144 



192 



240 



288 



336 



cac aac aag cca age gat tgc tgg att gtt gta aaa aac aag gtg tat 



384 
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His Asn Lys Pro Ser Asp Cys Trp lie Val Val Lys Ash Lys Val Tyr 
115 120 125 

gat gtt tec aat ttt gcg gac gag cat ccc gga gga tea gtt att agt 432 
Asp Val Ser Asn Phe Ala Asp Glu His Pro Gly Gly Ser Val lie Ser 
130 135 140 

act tat ttt gga cga gac ggc aca gat gtt ttc tct agt ttt cat gca 480 
Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val Phe Ser Ser Phe His Ala 
145 150 155 160 

get tct aca tgg aaa att ctt caa gac ttt tac att ggt gac gtg gag 52 8 
Ala Ser Thr Trp Lys lie Leu Gin Asp Phe Tyr He Gly Asp Val Glu 
165 170 175 



agg gtg gag ccg act cca gag ctg ctg aaa gat ttc cga gaa atg aga 576 
Arg Val Glu Pro Thr Pro Glu Leu Leu Lys Asp Phe Arg Glu Met Arg 
180 185 190 

get ctt ttc ctg agg gag caa ctt ttc aaa agt teg aaa ttg tac tat 62 4 
Ala Leu Phe Leu Arg Glu Gin Leu Phe Lys Ser Ser Lys Leu Tyr Tyr 
195 200 - 205 

gtt atg aag ctg etc acg aat gtt get att ttt get gcg age att gca 672 
Val Met Lys Leu Leu Thr Asn Val Ala He Phe Ala Ala Ser He Ala 
210 215 220 

ata ata tgt tgg age aag act att tea gcg gtt ttg get tea get tgt 7 20 
He He Cys Trp Ser Lys Thr lie Ser Ala Val Leu Ala Ser Ala Cys 
225 230 235 240 



atg atg get ctg tgt ttc caa cag tgc gga tgg eta tec cat gat ttt 
Met Met Ala Leu Cys Phe Gin Gin Cys Gly Trp Leu Ser His Asp Phe 
245 250 255 



768 



etc cac aat cag gtg ttt gag aca cgc tgg ctt aat gaa gtt gtc ggg 816 
Leu His Asn Gin Val Phe Glu Thr Arg Trp Leu Asn Glu Val val Gly 
260 265 270 



tat gtg ate ggc aac gee gtt ctg ggg ttt agt aca ggg tgg tgg aag 864 
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Tyr Val lie Gly Asn Ala Val Leu Gly Phe Ser Thr Gly Trp Trp Lys 
275 280 285 

gag aag cat aac ctt cat cat get get cca aat gaa tgc gat cag act 912 
Glu Lys His Asn Leu His His Ala Ala Pro Asn Glu Cys Asp Gin Thr 
290 295 300 

tac caa cca att gat gaa gat att gat act etc ccc etc att gec tgg 960 
Tyr Gin Pro lie Asp Glu Asp lie Asp Thr Leu Pro Leu lie Ala Trp 
305 310 315 320 

age aag gac ata ctg gee aca gtt gag aat aag aca ttc ttg cga ate 1008 
Ser Lys Asp He Leu Ala Thr Val Glu Asn Lys Thr Phe Leu Arg He 
325 330 335 

etc caa tac cag cat ctg ttc ttc atg ggt ctg tta ttt ttc gec cgt 1056 
Leu Gin Tyr Gin His Leu Phe Phe Met Gly Leu Leu Phe Phe Ala Arg 
340 345 350 

ggt agt tgg etc ttt tgg age tgg aga tat acc tct aca gca gtg etc 1104 
Gly Ser Trp Leu Phe Trp Ser Trp Arg Tyr Thr Ser Thr Ala Val Leu 
355 360 365 

tea cct gtc gac agg ttg ttg gag aag gga act gtt ctg ttt cac tac 1152 
Ser Pro Val Asp Arg Leu Leu Glu Lys Gly Thr Val Leu Phe His Tyr 
370 375 380 

ttt tgg ttc gtc ggg aca gcg tgc tat ctt etc cct ggt tgg aag cca 120 0 
Phe Trp Phe Val Gly Thr Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro 
385 390 395 400 

tta gta tgg atg gcg gtg act gag etc atg tec ggc atg ctg ctg ggc 124 8 
Leu Val Trp Met Ala Val Thr Glu Leu Met Ser Gly Met Leu Leu Gly 
405 410 415 

ttt gta ttt gta ctt age cac aat ggg atg gag gtt tat aat teg tct 1296 
Phe Val Phe Val Leu Ser His Asn Gly Met Glu Val Tyr Asn Ser Ser 
420 425 430 



aaa gaa ttc gtg agt gca cag ate gta tec aca egg gat ate aaa gga 



1344 
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Lys Glu Phe Val Ser Ala Gin lie Val 
435 440 

aac ata ttc aac gac tgg ttc act ggt 
Asn lie Phe Asn Asp Trp Phe Thr Gly 
450 455 

cat cat ctt ttc cca aca atg ccc agg 
His His Leu Phe Pro Thr Met Pro Arg 
465 470 

cct aga gtg gag gtg ttc tgt aag aaa 
Pro Arg Val Glu Val Phe Cys Lys Lys 
485 



49 

Ser Thr Arg Asp lie Lys Gly 
445 

ggc ctt aac agg caa ata gag 1392 
Gly Leu Asn Arg Gin lie Glu 
460 

cat aat tta aac aaa ata gca 1440 
His Asn Leu Asn Lys lie Ala 
475 480 

cac ggt ctg gtg tac gaa gac 1488 
His Gly Leu Val Tyr Glu Asp 
490 495 



gta tct att get acc ggc act tgc aag gtt ttg aaa gca ttg aag gaa 1536 
Val Ser lie Ala Thr Gly Thr Cys Lys Val Leu Lys Ala Leu Lys Glu 
500 505 510 

gtc gcg gag get gcg gca gag cag cat get acc acc agt taa ' 1578 

Val Ala Glu Ala Ala. Ala .Glu Gin His Ala Thr Thr Ser 
515 520 525 



<210> 18 
<211> 525 
<212> PRT 

<213> Physcomitrella patens 



<400> 18 

Met Val Phe Ala Gly Gly Gly Leu 
1 5 

lie Asp Val Glu His lie Ala Ser 
20 

Ser Tyr Val Ser Ser Thr Val Gly 
35 40 

Pro Leu Lys Arg Leu Thr Ser Lys 



Gin Gin Gly Ser Leu Glu Glu Asn 
10 15 

Met Ser Leu Phe Ser Asp Phe Phe 
25 30 

Ser Trp Ser Val His Ser He Gin 

45 

Lys Arg Val Ser Glu Ser Ala Ala 
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50 55 60 

val Gin Cys He Ser Ala Glu Val Gin Arg Asn Ser Ser Thr Gin Gly 
65 70 75 80 

Thr Ala Glu Ala Leu Ala Glu Ser Val Val Lys Pro Thr Arg Arg Arg 

85 90 *5 

Ser Ser Gin Trp Lys Lys Ser Thr His Pro Leu Ser Glu Val Ala Val 
100 105 110 

His Asn Lys Pro Ser Asp Cys Trp He Val Val Lys Asn Lys Val Tyr 
115 120 125 

Asp Val Ser Asn Phe Ala Asp Glu His Pro Gly Gly Ser Val He Ser 
130 135 140 

Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val Phe Ser Ser Phe His Ala 
145 150 155 160 

Ala Ser Thr Trp Lys He Leu Gin Asp Phe Tyr He Gly Asp Val Glu 
165 170 175 

Arg Val Glu Pro Thr Pro Glu Leu Leu Lys Asp Phe Arg Glu Met Arg 
180 185 190 

Ala Leu Phe Leu Arg Glu Gin Leu Phe Lys Ser Ser Lys Leu Tyr Tyr 
195 200 205 

Val Met Lys Leu Leu Thr Asn Val Ala He Phe Ala Ala Ser He Ala 
210 215 220 

He He Cys Trp Ser Lys Thr He Ser Ala Val Leu Ala Ser Ala Cys 
225 230 235 240 

Met Met Ala Leu Cys Phe Gin Gin Cys Gly Trp Leu Ser His Asp Phe 
245 250 255 

Leu His Asn Gin Val Phe Glu Thr Arg Trp Leu Asn Glu Val Val Gly 
260 265 270 
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Tyr Val lie Gly Asn Ala Val Leu Gly Phe Ser Thr Gly Trp Trp Lys 
275 280 285 

Glu Lys His Asn Leu His His Ala Ala Pro Asn Glu Cys Asp Gin Thr 
290 295 300 

Tyr Gin Pro He Asp Glu Asp He Asp Thr Leu Pro Leu He Ala Trp 
305 310 315 320 

Ser Lys Asp lie Leu Ala Thr Val Glu Asn Lys Thr Phe Leu Arg lie 
325 330 335 

Leu Gin Tyr Gin His Leu Phe Phe Met Gly Leu Leu Phe Phe Ala Arg 
340 345 350 

Gly Ser Trp Leu Phe Trp Ser Trp Arg Tyr Thr Ser Thr Ala Val Leu 
355 360 365 



Ser Pro Val Asp Arg Leu Leu Glu Lys Gly Thr Val Leu Phe' His Tyr 
370 - _ 375 380 

Phe Trp Phe Val Gly Thr Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro 
385 390 395 400 

Leu Val Trp Met Ala Val Thr Glu Leu Met Ser Gly Met Leu Leu Gly 
405 410 415 

Phe Val Phe Val Leu Ser His Asn Gly Met Glu Val Tyr Asn Ser Ser 
420 425 430 

Lys Glu Phe Val Ser Ala Gin He Val Ser Thr Arg Asp lie Lys Gly 
435 440 445 

Asn He Phe Asn Asp Trp Phe Thr Gly Gly Leu Asn Arg Gin He Glu 
450 455 460 



His His Leu Phe Pro Thr Met Pro Arg His Asn Leu Asn Lys He Ala 
465 470 475 480 
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Pro Arg Val Glu Val Phe Cys Lys 
485 

Val Ser He Ala Thr Gly Thr Cys 
500 

Val Ala Glu Ala Ala Ala Glu Gin 
515 520 



52 

Lys His Gly Leu val Tyr Glu Asp 
490 495 

Lys Val Leu Lys Ala Leu Lys Glu 
505 510 

His Ala Thr Thr Ser 
525 



<210> 19 
<211> 837 
<212> DNA 

<213> Phytophthora 



infestans 



<220> 

<221> CDS 

<222> (1)..<837) 

<223> A6-elongase 



<400> 19 

atg teg act gag eta ctg cag age 

Met Ser Thr Glu Leu Leu Gin Ser 
1 5 

gag gee aag ctg ctg gac tgg gtc 
Glu Ala Lys Leu Leu Asp Trp Val 
20 



tac tac gcg tgg gec aac gec acg 4 8 
Tyr Tyr Ala Trp Ala Asn Ala Thr 
10 15 

gac cct gag ggc ggc tgg aag gtg 9 6 
Asp Pro Glu Gly Gly Trp Lys Val 
25 30 



cat cct atg gca gac tac ccc eta gee aac ttc tec age gtc tac gec 
His Pro Met Ala Asp Tyr Pro Leu Ala Asn Phe Ser Ser Val Tyr Ala 
35 40 45 

ate tgc gtc gga tac ttg etc ttc gta ate ttc ggc acg gec ctg atg 
lie Cys val Gly Tyr Leu Leu Phe Val lie Phe Gly Thr Ala Leu Met 
50 55 60 

aaa atg gga gtc ccc gee ate aag acc agt cca tta cag ttt gtg tac 
Lys Met Gly Val Pro Ala lie Lys Thr Ser Pro Leu Gin Phe Val Tyr 
65 70 75 80 



144 



192 



240 
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aac ccc ate 
Asn Pro He 

ate cag gec 
He Gin Ala 



aag tec gac 
Lys Ser Asp 
115 

tec aag atg 
Ser Lys Met 
130 

aag tgg aaa 
Lys Trp Lys 

145 ",. 

ctt ttc gtc 
Leu Phe Val 

tat get ace 
Tyr Ala Thr 

tac tac ttc 
Tyr Tyr Phe 
195 

etc acg cgc 
Leu Thr Arg 
210 

tac ctg ace 
Tyr Leu Thr 
225 



caa gtc att 
Gin Val He 
85 

tac cgc aac 
Tyr Arg Asn 
100 

gac ccc gtc 
Asp Pro Val 

etc gac ctg 
Leu Asp Leu 

cag ctt tec 
Gin Leu Ser 
150 

tac tat gtg 
Tyr Tyr Val 
165 

ate gtg etc 
He Val Leu 
180 

gtc age gec 
val ser Ala 



att cag ctt 
He Gin Leu 



tac tct cga 
Tyr Ser Arg 
230 



gec tgc tct 
Ala Cys Ser 

ggc tac ace 
Gly Tyr Thr 

. 105 

atg ggc aac 
Met Gly Asn 
120 

tgc gac aca 
Cys Asp Thr 
135 

ate ttg cac 
He Leu His 

acg ttc cgc 
Thr Phe Arg 

aac ggc ttc 
Asn Gly Phe 
185 

cac acg cgc 
His Thr Arg 
200 

ate cag ttc 
He Gin Phe 
215 

cag tgc cca 
Gin Cys Pro 



53 

tat atg tgc 
Tyr Met Cys 
90 

gee gee ccg 
Ala Ala Pro 



gtt ctg tac 
Val Leu Tyr 



gtc ttc att 
Val Phe lie 
140 

gtg tac cac 
Val Tyr His 

155 

gee get cag 
Ala Ala Gin 
170 

gtg cac ace 
Val His Thr 



aac att tgg 
Asn lie Trp 



gtg acc atg 
Val Thr Met 
220 

ggc atg cct 
Gly Met Pro 
235 



gtg gag gec 
Val Glu Ala 

tgc aac gec 
Cys Asn Ala 
110 

etc ttc tat 
Leu Phe Tyr 
125 

ate eta gga 
He Leu Gly 

cac ctt acc 
His Leu Thr 



gac ggg gac 
Asp Gly Asp 
175 

ate atg tac 
He Met Tyr 
190 

tgg aag aag 
Trp Lys Lys 
205 

aac gtg cag 
Asn Val Gin 



cct aag gtg 
Pro Lys Val 



gec 2 88 

Ala . 



ttt 336 
Phe 

etc 384 
Leu 

aag 432 
Lys 



gtg 480 
Val 

160; - - 

tea 528 
Ser 



act 576 
Thr 



tac 624 
Tyr 

ggc 672 
Gly 



ccg 720 

Pro 

240 
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etc atg tac ctt gtg tac gtg cag tea etc ttc tgg etc ttc atg aat 768 
Leu Met Tyr Leu Val Tyr Val Gin Ser Leu Phe Trp Leu Phe Met Asn 
245 250 . 255 

ttc tac att cgc gcg tac gtg ttc ggc ccc aag aaa ccg gec gtg gag 816 
Phe Tyr lie Arg Ala Tyr Val Phe Gly Pro Lys Lys Pro Ala Val Glu 
260 265 270 

gaa teg aag aag aag ttg taa 837 
Glu Ser Lys Lys Lys Leu 
275 



<210> 20 
<211> 278 
<212> PRT 

<213> Phytophthora infestans 
<400> 20 

Met Ser Thr Glu Leu Leu Gin Ser Tyr Tyr Ala Trp Ala Asn Ala Thr 
15 10 15 

Glu Ala Lys Leu Leu Asp Trp Val Asp Pro Glu Gly Gly Trp Lys Val 
20 25 30 

His Pro Met Ala Asp Tyr Pro Leu Ala Asn Phe Ser Ser Val Tyr Ala 
35 40 45 

lie Cys Val Gly Tyr Leu Leu Phe Val He Phe Gly Thr Ala Leu Met 
50 55 60 

Lys Met Gly Val Pro Ala He Lys Thr Ser Pro Leu Gin Phe Val Tyr 
65 70 75 80 

Asn Pro He Gin Val lie Ala Cys Ser Tyr Met Cys Val Glu Ala Ala 

85 . 90 95 

He Gin Ala Tyr Arg Asn Gly Tyr Thr Ala Ala Pro Cys Asn Ala Phe 
100 105 110 
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Lys Ser Asp Asp Pro Val Met Gly Asn Val Leu Tyr Leu Phe Tyr Leu 
115 120 125 

Ser Lys Met Leu Asp Leu Cys Asp Thr Val Phe lie lie Leu Gly Lys 
130 135 140 

Lys Trp Lys Gin Leu Ser lie Leu His Val Tyr His His Leu Thr Val 
145 150 155 160 

Leu Phe Val Tyr Tyr Val Thr Phe Arg Ala Ala Gin Asp Gly Asp Ser 
165 170 175 

Tyr Ala Thr lie Val Leu Asn Gly Phe Val His Thr lie Met Tyr Thr 
180 185 190 

Tyr Tyr Phe Val Ser Ala His Thr Arg Asn lie Trp Trp Lys Lys Tyr 
195 200 205 

Leu Thr Arg lie Gin Leu -lie Gin Phe Val Thr Met Asn Val Gin Gly- 
210 215 220 

Tyr Leu Thr Tyr Ser Arg Gin Cys Pro Gly Met Pro Pro Lys Val Pro 
225 230 235 240 

Leu Met Tyr Leu Val Tyr Val Gin Ser Leu Phe Trp Leu Phe Met Asn 
245 250 255 

Phe Tyr lie Arg Ala Tyr Val Phe Gly Pro Lys Lys Pro Ala val Glu 
260 265 270 

Glu Ser Lys Lys Lys Leu 
275 



<210> 21 
<211> 1410 
<212> DNA 

<213> Phaeodactylum tricornutum 
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<220> 

<221> CDS 

<222> (1)..{1410) 

<223> A5-desaturase 



US 120 



gga tgg ttc ttc cgt gcg ttt tgc tac att gcc att ttc ttc tac 
Gly Trp Phe Phe Arg Ala Phe Cys Tyr He Ala He Phe Phe Tyr Leu 



144 



192 



<400> 21 

atg get ccg gat gcg gat aag ctt cga caa =gc cag acg act gcg gta 48 
Met Ala Pro Asp Ala Asp Lys Leu Arg Gin Arg Gin Thr Thr Ala Val 

15 

gcg aag cac aat get get acc ata teg acg cag gaa cgc ctt tgc agt 96 
Ala Lys His Asn Ala Ala Thr He Ser Thr Gin Glu Arg Leu Cys Ser 

ctg tct teg etc aaa ggc gaa gaa gtc tgc ate gac gga ate ate tat 
Leu Ser Ser Leu Lys Gly Glu Glu Val Cys He Asp Gly lie He Tyr 
35 40 45 • 

gac etc caa tea ttc gat cat ccc ggg ggt gaa acg atc aaa ^ ^ 
Asp Leu Gin Ser Phe Asp His Pro Gly Gly Glu Thr He Lys Met Phe 
50 55 60 

ggt ggc aae gat gtc act gta cag tac aag atg att cac ccg tae cat 
Gly Gly Asn Asp Val Thr Val Gin Tyr Lys Met He His Pro Tyr His 

acc gag aag cat ttg gaa aag atg aag cgt gtc ggc aag gtg acg gat 288 
Thr Glu Lys His Leu Glu Lys Met Lys Arg Val Gly Lys Val Thr Asp 
85 90 95 

ttc g tc tgc gag tac aag ttc gat acc gaa ttt gaa cgc gaa atc aaa 336 
Phe val cys Glu Tyr Lys Phe Asp Thr Glu Phe Glu Arg Glu He Lys 

105 

cga g aa gtc ttc aag att gtg cgfl cgfl ggc ^ ^ ^ ^ ^ 
Arg Glu val Phe Lys He Val Arg Arg Gly Lys Asp Phe Gly Thr Leu 

125 



240 



432 
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130 135 140 ' 

cag tac cat tgg gtc acc acg gga acc tct tgg ctg ctg gcc gtg gcc 480 
Gin Tyr His Trp Val Thr Thr Gly Thr Ser Trp Leu Leu Ala Val Ala 
145 150 155 160 

tac gga ate tec caa gcg atg att ggc atg aat gtc cag cac gat gcc 528 
Tyr Gly He Ser Gin Ala Met He Gly Met Asn Val Gin His Asp Ala 
165 170 175 



aac cac ggg gcc acc tec aag cgt ccc tgg gtc aac gac atg eta ggc 576 
Asn His Gly Ala Thr Ser Lys Arg Pro Trp Val Asn Asp Met Leu Gly 
180 185 190 

etc ggt gcg gat ttt att ggt ggt tec aag tgg etc tgg cag gaa caa 62 4 
Leu Gly Ala Asp Phe lie Gly Gly Ser Lys Trp Leu Trp Gin Glu Gin 
195 200 205 

cac tgg acc cac cac get tac acc aat cac gcc gag atg gat ccc gat 67 2 
His Trp Thr His His Ala Tyr Thr Asn His Ala Glu Met Asp Pro Asp 
210 215 220 

age ttt ggt gcc gaa cca atg etc eta ttc aac gac tat ccc ttg gat 72 0 
Ser Phe Gly Ala Glu Pro Met Leu Leu Phe Asn Asp Tyr Pro Leu Asp 
225 230 235 240 



cat ccc get cgt acc tgg eta cat cgc ttt caa gca ttc ttt tac atg 76 8 
His Pro Ala Arg Thr Trp Leu His Arg Phe Gin Ala Phe Phe Tyr Met 
245 250 255 

ccc gtc ttg get gga tac tgg ttg tec get gtc ttc aat cca caa att 816 
Pro Val Leu Ala Gly Tyr Trp Leu Ser Ala Val Phe Asn Pro Gin He 
260 265 270 

ctt gac etc cag caa cgc ggc gca ctt tec gtc ggt ate cgt etc gac 864 
Leu Asp Leu Gin Gin Arg Gly Ala Leu Ser Val Gly He Arg Leu Asp 
275 280 285 

aac get ttc att cac teg cga cgc aag tat gcg gtt ttc tgg egg get 912 
Asn Ala Phe He His Ser Arg Arg Lys Tyr Ala Val Phe Trp Arg Ala 
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290 295 300 

gtg tac att gcg gtg aac gtg att get ccg ttt tac aca aac tec ggc 960 
Val Tyr He Ala Val Asn Val He Ala Pro Phe Tyr Thr Asn Ser Gly 
305 310 315 320 

etc gaa tgg tec tgg cgt gtc ttt gga aac ate atg etc atg ggt gtg 1008 
Leu Glu Trp Ser Trp Arg Val Phe Gly Asn He Met Leu Met Gly val 
325 330 335 

gcg gaa teg etc gcg ctg gcg gtc ctg ttt teg ttg teg cac aat ttc 1056 
Ala Glu Ser Leu Ala Leu Ala Val Leu Phe Ser Leu Ser His Asn Phe 
340 345 350 

gaa tec gcg gat cgc gat ccg acc gec cca ctg aaa aag acg gga gaa 1104 
Glu Ser Ala Asp Arg Asp Pro Thr Ala Pro Leu Lys Lys Thr Gly Glu 
355 360 365 

cca gtc gac tgg ttc aag aca cag gtc gaa act tec tgc act tac ggt 1152 
Pro Val Asp Trp Phe Lys Thr Gin Val Glu Thr Ser Cys Thr Tyr Gly 
370 375 380 

gga ttc ctt tec ggt tgc ttc acg gga ggt etc aac ttt cag gtt gaa 1200 
Gly Phe Leu Ser Gly Cys Phe Thr Gly Gly Leu Asn phe Gin Val Glu 
385 390 395 400 

cac cac ttg ttc cca cgc atg age age get tgg tat ccc tac att gec 124 8 
His His Leu Phe Pro Arg Met Ser Ser Ala Trp Tyr Pro Tyr He Ala 
405 410 415 

ccc aag gtc cgc gaa att tgc gec aaa cac ggc gtc cac tac gec tac 129 6 
Pro Lys Val Arg Glu He Cys Ala Lys His Gly Val His Tyr Ala Tyr 
420 425 430 

tac ccg tgg ate cac caa aac ttt etc tec acc gtc cgc tac atg cac 1344 
Tyr Pro Trp He His Gin Asn Phe Leu Ser Thr Val Arg Tyr Met His 
435 440 445 



gcg gee ggg acc ggt gec aac tgg cgc cag atg gec aga gaa aat ccc 
Ala Ala Gly Thr Gly Ala Asn Trp Arg Gin Met Ala Arg Glu Asn Pro 



1392 
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450 455 460 

ttg acc gga egg gcg taa 
Leu Thr Gly Arg Ala 
465 470 



<210> 22 
<211> 469 
<212> PRT 

<213> Phaeodactylum tricornutum 
<4D0> 22 

Met Ala Pro Asp Ala Asp Lys Leu Arg Gin Arg Gin Thr Thr Ala Val 
15 10 15 

Ala Lys His Asn Ala Ala Thr lie Ser Thr Gin Glu Arg Leu Cys Ser 
20 25 . 30 

Leu Ser Ser Leu Lys Gly Glu Glu Val Cys lie Asp Gly lie lie Tyr 
35 40 . 45 

Asp Leu Gin Ser Phe Asp His Pro Gly Gly Glu Thr lie Lys Met Phe 
50 55 60 

Gly Gly Asn Asp Val Thr Val Gin Tyr Lys Met lie His Pro Tyr His 
65 70 75 80 

Thr Glu Lys His Leu Glu Lys Met Lys Arg Val Gly Lys Val Thr Asp 

85 90 95 

Phe Val Cys Glu Tyr Lys Phe Asp Thr Glu Phe Glu Arg Glu lie Lys 
100 105 110 

Arg Glu Val Phe Lys lie Val Arg Arg Gly Lys Asp Phe Gly Thr Leu 
115 120 125 



Gly Trp Phe Phe Arg Ala Phe Cys Tyr lie Ala lie Phe Phe Tyr Leu 
130 135 140 
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Gin Tyr His Trp Val Thr Thr Gly Thr Ser Trp Leu Leu Ala Val Ala 
l « "0 155 160 

Tyr Gly He Ser Gin Ala Met He Gly Met Asn Val Gin His Asp Ala 
165 170 175 

Asn His Gly Ala Thr Ser Lys Arg Pro Trp Val Asn Asp Met Leu Gly 
180 185 190 

Leu Gly Ala Asp Phe lie Gly Gly Ser Lys Trp Leu Trp Gin Glu Gin 
195 200 205 

His Trp Thr His His Ala Tyr Thr Asn His Ala Glu Met Asp Pro Asp 
210 215 220 

Ser Phe Gly Ala Glu Pro Met Leu Leu Phe Asn Asp Tyr Pro Leu Asp 
2 " 230 235 240 

His Pro Ala Arg Thr Trp Leu His Arg Phe Gin Ala Phe Phe Tyr Met 
245 250 255 

Pro Val Leu Ala Gly Tyr Trp Leu Ser Ala Val Phe Asn Pro Gla lie 
260 265 270 

Leu Asp Leu Gin Gin Arg Gly Ala Leu Ser Val Gly He Arg Leu Asp 
275 280 285 

Asn Ala Phe He His Ser Arg Arg Lys Tyr Ala Val Phe Trp Arg Ala 
290 295 300 

Val Tyr He Ala Val Asn Val He Ala Pro Phe Tyr Thr Asn Ser Gly 
3 °5 310 315 320 

Leu Glu Trp Ser Trp Arg Val Phe Gly Asn He Met Leu Met Gly val 
325 330 335 

Ala Glu Ser Leu Ala Leu Ala Val Leu Phe Ser Leu Ser His Asn Phe 
340 345 350 



Glu Ser Ala Asp Arg Asp Pro Thr Ala Pro Leu Lys Lys Thr Gly Glu 
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Pro Val Asp Trp Phe 
370 

Gly Phe Leu Ser Gly 
385 

His His Leu Phe Pro 
405 

Pro Lys Val Arg Glu 
420 

Tyr Pro Trp lie His 
435 

Ala Ala Gly Thr Gly 
450 

Leu Thr Gly Arg Ala 
465 



61 

360 

Lys Thr Gin Val Glu Thr 
375 

Cys Phe Thr Gly Gly Leu 
390 395 

Arg Met Ser Ser Ala Trp 
410 

lie Cys Ala Lys His Gly 
425 

Gin Asn Phe Leu Ser Thr 
4 40 

Ala Asn Trp Arg Gin Met 
455 



365 - 

Ser Cys Thr Tyr Gly 
380 

Asn Phe Gin Val Glu 
400 

Tyr Pro Tyr lie Ala 
415 

Val His Tyr Ala Tyr 
430 

Val Arg Tyr Met His 
445 

Ala Arg Glu Asn Pro 
460 



<210> 23 
<211> 1344 
<212> DNA 

<213> Caenorhabditis elegans 

<220> 

<221> CDS 

<222> (1)..(1344) 

<223> A5-desaturase 

<400> 23 

atg gta tta cga gag caa gag cat gag cca ttc ttc att aaa att gat 4 8 
Met Val Leu Arg Glu Gin Glu His Glu Pro Phe Phe lie Lys lie Asp 
1 5 10 15 



gga aaa tgg tgt caa att gac gat get gtc ctg aga tea cat cca ggt 9 6 
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Gly Lys Trp Cys Gin lie Asp Asp Ala Val Leu Arg Ser His Pro Gly 
20 25 30 

ggt agt gca att act acc tat aaa aat atg gat gcc act acc gta ttc 14 4 
Gly ser Ala lie Thr Thr Tyr Lys Asn Met Asp Ala Thr Thr Val Phe 
35 40 45 

cac aca ttc cat act ggt tct aaa gaa gcg tat caa tgg ctg aca gaa 192 
His Thr Phe His Thr Gly Ser Lys Glu Ala Tyr Gin Trp Leu Thr Glu 
50 55 60 

ttg aaa aaa gag tgc cct aca caa gaa cca gag ate cca gat att aag 240 
Leu Lys Lys Glu Cys Pro Thr Gin Glu Pro Glu lie Pro Asp He Lys 
65 70 75 . 80 

gat gac cca ate aaa gga att gat gat gtg aac atg gga act ttc aat 2 88 
Asp Asp Pro He Lys Gly He Asp Asp Val Asn Met Gly Thr Phe Asn 

35 90 95 

att tct gag aaa cga tct gcc caa ata aat aaa agt ttc act gat eta 336 
He Ser Glu Lys Arg Ser Ala Gin He Asn Lys Ser Phe Thr Asp Leu 
100 105 no 

cgt atg cga gtt cgt gca gaa gga ctt atg gat gga tct. cct ttg ttc 384 
Arg Met Arg Val Arg Ala Glu Gly Leu Met Asp Gly Ser Pro Leu Phe 
"5 120 125 

tac att aga aaa att ctt gaa aca ate ttc aca att ctt ttt gca ttc 4 32 
Tyr He Arg Lys He Leu Glu Thr He Phe Thr He Leu Phe Ala Phe 
13 0 135 140 

tac ctt caa tac cac aca tat tat ctt cca tea get att eta atg gga 480 
Tyr Leu Gin Tyr His Thr Tyr Tyr Leu Pro Ser Ala He Leu Met Gly 
145 150 155 leo 

gtt gcg tgg caa caa ttg gga tgg tta ate cat gaa ttc gca cat cat 528 
val Ala Trp Gin Gin Leu Gly Trp Leu He His Glu Phe Ala His His 
165 170 175 

cag ttg ttc aaa aac aga tac tac aat gat ttg gcc age tat ttc gtt 576 



0093/00064 



CA 02485060 2004-10-28 



63 

Gin Leu Phe Lys Asn Arg Tyr Tyr Asn Asp Leu Ala Ser Tyr Phe Val 
180 185 190 

gga aac ttt tta caa gga ttc tea tct ggt ggt tgg aaa gag cag cac 624 
Gly Asn Phe Leu Gin Gly Phe Ser Ser Gly Gly Trp Lys Glu Gin His 
195 200 205 

aat gtg cat cac gca gec aca aat gtt gtt gga cga gac gga gat ctt 6 72 
Asn Val His His Ala Ala Thr Asn Val val Gly Arg Asp Gly Asp Leu 
210 215 220 



gat tta gtc cca ttc tat get aca gtg gca gaa cat etc aac aat tat 720 

Asp Leu Val Pro Phe Tyr Ala Thr Val Ala Glu His Leu Asn Asn Tyr 

225 230 235 240 

tct cag gat tea tgg gtt atg act eta ttc aga tgg caa cat gtt cat 768 

Ser Gin Asp Ser Trp Val Met Thr Leu Phe Arg Trp Gin His Val His 

245 250 255 

tgg aca ttc atg tta cca ttc etc cgt etc teg tgg ctt ctt cag tea 816 

Trp Thr Phe Met Leu Pro Phe Leu Arg Leu Ser Trp Leu Leu Gin Ser 
260 265 270 

ate att ttt gtt agt cag atg cca act cat tat tat gac tat tac aga 8 64 

lie lie Phe Val Ser Gin Met Pro Thr His Tyr Tyr Asp Tyr Tyr Arg 
275 280 285 

aat act gcg att tat gaa cag gtt ggt etc tct ttg cac tgg get tgg 912 

Asn Thr Ala lie Tyr Glu Gin Val Gly Leu Ser Leu His Trp Ala Trp 
290 295 300 



tea ttg ggt caa ttg tat ttc eta ccc gat tgg tea act aga ata atg 960 
Ser Leu Gly Gin Leu Tyr Phe Leu Pro Asp Trp Ser Thr Arg lie Met 
305 310 315 320 

ttc ttc ctt gtt tct cat ctt gtt gga ggt ttc ctg etc tct cat gta 1008 
Phe Phe Leu Val Ser His Leu Val Gly Gly Phe Leu Leu Ser His Val 
325 330 335 



gtt act ttc aat cat tat tea gtg gag aag ttt gca ttg age teg aac 



1056 
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val Thr Phe Asn His Tyr Ser val Glu Ly S Phe Ala Leu Ser Ser Asn 
340 345 350 

ate atg tea aat tac get tgt ett caa ate atg acc aca aga aat atg n 04 
He Met Ser Asn Tyr Ala Cys Leu Gin He Met Thr Thr Arg Asn Met 
355 3 6 o 365 

aga ect gga aga tte att gac tgg ett tgg gga ggt ett aae tat eag U52 
Arg Pro Gly Arg Phe He Asp Trp Leu Trp Gly Gly Leu Asn Tyr Gin 
370 375 380 

att gag cae eat ett tte eea aeg atg eea ega cac aae ttg aae aet 1200 
He Glu His His Leu Phe Pro Thr Met Pro Arg His Asn Leu Asn Thr 
385 390 395 400 

gtt atg cea ett gtt aag gag ttt gea gca gea aat ggt tta eea tac 1248 
Val Met Pro Leu Val Lys Glu Phe Ala Ala Ala Asn Gly Leu Pro Tyr 

415 



005 410 



atg gte gac gat tat tte aca gga tte tgg ett gaa att gag caa tte 1296 
Met val Asp Asp Tyr Phe Thr Gly Phe Trp Leu Glu lie Glu Gin Phe 
420 425 4 3o 

cga aat att gca aat gtt get get aaa ttg act aaa aag att gec tag 1344 
Arg Asn He Ala Asn Val Ala Ala Lys Leu Thr Lys Lys He Ala 
43 5 440 445 



<210> 24 
<211> 447 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 24 

Met Val Leu Arg Glu Gin Glu His 
1 5 

Gly Lys Trp Cys Gin He Asp Asp 
20 



Glu Pro Phe Phe He Lys He Asp 
10 15 

Ala val Leu Arg Ser His Pro Gly 
25 30 
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Gly Ser Ala lie Thr Thr Tyr Lys Asn Met Asp Ala Thr Thr Val Phe 
35 40 45 

His Thr Phe His Thr Gly Ser Lys Glu Ala Tyr Gin Trp Leu Thr Glu 
50 55 60 

Leu Lys Lys Glu Cys Pro Thr Gin Glu Pro Glu He Pro Asp lie Lys 
65 70 75 80 

Asp Asp Pro He Lys Gly He Asp Asp Val Asn Met Gly Thr Phe Asn 

85 90 95 

lie Ser Glu Lys Arg Ser Ala Gin He Asn Lys Ser Phe Thr Asp Leu 
100 105 110 

Arg Met Arg Val Arg Ala Glu Gly Leu Met Asp Gly Ser Pro Leu Phe 

1X5 120 - 125 ^ 

Tyr He Arg Lys He Leu Glu Thr He Phe Thr He Leu Phe Ala Phe 
130 .135 140. 

Tyr Leu Gin Tyr His Thr Tyr Tyr Leu Pro Ser Ala He Leu Met Gly 
145 150 155 160 

Val Ala Trp Gin Gin Leu Gly Trp Leu lie His Glu Phe Ala His His 
165 170 175 

Gin Leu Phe Lys Asn Arg Tyr Tyr Asn Asp Leu Ala Ser Tyr Phe Val 
180 185 190 

Gly Asn Phe Leu Gin Gly Phe Ser Ser Gly Gly Trp Lys Glu Gin His 
195 200 205 

Asn Val His His Ala Ala Thr Asn Val Val Gly Arg Asp Gly Asp Leu 
210 215 220 

Asp Leu Val Pro Phe Tyr Ala Thr Val Ala Glu His Leu Asn Asn Tyr 
225 230 235 240 

Ser Gin Asp Ser Trp Val Met Thr Leu Phe Arg Trp Gin His Val His 
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245 250 255 

Trp Thr Phe Met Leu Pro Phe Leu Arg Leu Ser Trp Leu Leu Gin Ser 
260 265 270 

lie He Phe Val Ser Gin Met Pro Thr His Tyr Tyr Asp Tyr Tyr Arg 
275 280 285 

Asn Thr Ala He Tyr Glu Gin Val Gly Leu Ser Leu His Trp Ala Trp 
290 295 300 

Ser Leu Gly Gin Leu Tyr Phe Leu Pro Asp Trp Ser Thr Arg He Met 
305 310 315 320 

Phe Phe Leu Val Ser His Leu Val Gly Gly Phe Leu Leu Ser His Val 
325 330 335 

Val Thr Phe Asn His Tyr Ser Val Glu Lys Phe Ala Leu Ser Ser Asn 
340 345 350 

He Met Ser Asn Tyr Ala Cys Leu Gin He Met Thr Thr Arg Asn Met 
355 360 365 

Arg Pro Gly Arg Phe lie Asp Trp Leu Trp Gly Gly Leu Asn Tyr Gin 
370 375 380 

He Glu His His Leu Phe Pro Thr Met Pro Arg His Asn Leu Asn Thr 
385 390 395 400 

Val Met Pro Leu Val Lys Glu Phe Ala Ala Ala Asn Gly Leu Pro Tyr 
405 410 415 

Met Val Asp Asp Tyr Phe Thr Gly Phe Trp Leu Glu He Glu Gin Phe 
420 425 430 

Arg Asn He Ala Asn Val Ala Ala Lys Leu Thr Lys Lys He Ala 
435 440 445 



<210> 25 
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<211> 954 
<212> DNA 

<213> Mortierella alpina 

<220> 

<221> CDS 

<222> ( 1) . . (954) 

<223> A6-elongase 

<400> 25 

atg gcc gcc gca ate ttg gac aag gtc aac ttc ggc att gat cag ccc 

Met Ala Ala Ala lie Leu Asp Lys Val Asn Phe Gly lie Asp Gin Pro 
1 .5 10 15 

ttc gga ate aag etc gac ace tac ttt get cag gcc tat gaa etc gtc 
Phe Gly lie Lys Leu Asp Thr Tyr Phe Ala Gin Ala Tyr Glu Leu Val 
20 25 30 

ace gga aag tec ate gac tec ttc gtc ttc cag gag ggc gtc acg cct 
Thr Gly Lys Ser lie Asp Ser Phe Val Phe Gin Glu Gly Val- Thr Pro 
35 40 45 



etc teg acc cag aga gag gtc gcc atg tgg act ate act tac ttc gtc 

Leu Ser Thr Gin Arg Glu Val Ala Met Trp Thr He Thr Tyr Phe Val 
50 55 60 

gtc ate ttt ggt ggt cgc cag ate atg aag age cag gac gcc ttc aag 

Val He Phe Gly Gly Arg Gin He Met Lys Ser Gin Asp Ala Phe Lys 

65 70 75 80 



etc aag ccc etc 
Leu Lys Pro Leu 

gga teg ctg ttg 
Gly Ser Leu Leu 
100 

aga aac gga ctt 
Arg Asn Gly Leu 



ttc ate etc cac 
Phe lie Leu His 
85 

etc ctg ttc ate 
Leu Leu Phe lie 



ttc tac gcc ate 
Phe Tyr Ala He 



aac ttc etc ctg 
Asn Phe Leu Leu 
90 

gag aac ctg gtc 
Glu Asn Leu Val 
105 

tgc gac gac ggt 
Cys Asp Asp Gly 



acg ate gcg tec 
Thr He Ala Ser 
95 

ccc ate etc gcc 
Pro He Leu Ala 
110 

gcc tgg acc cag 
Ala Trp Thr Gin 
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cgc etc gag etc 
Arg Leu Glu Leu 
130 

ttg gec gac acc 
Leu Ala Asp Thr 
145 

ctg cac tac ttc 
Leu His Tyr Phe 

ctt gga gga tac 
Leu Gly Gly Tyr 
180 

act gtc cac gtc 
Thr val His Val 
195 

gtt cgc ate tgg 
Val Arg lie Trp 
210 

ttc gtt ctt gac 
Phe Val Leu Asp 
225 

gec ttc acc tac 
Ala Phe Thr Tyr 



acc gag ggt get 
Thr Glu Gly Ala 
260 

ttg etc ttt ate 
Leu Leu Phe He 



etc tac tac 
Leu Tyr Tyr 
135 

gtc ttt ttg 
Val Phe Leu 
150 

cac cac teg 
His His Ser 
165 

act tea gtg 
Thr Ser Val 



ttc atg tac 
Phe Met Tyr 

tgg aag cag 
Trp Lys Gin 
215 

etc gga ttc 
Leu Gly Phe 
230 

ttc ccc tgg 
Phe Pro Trp 
245 

get etc ttt 
Ala Leu Phe 

aac ttc tac 
Asn Phe Tyr 



68 

120 

etc aac tac ctg 
Leu Asn Tyr Leu 

gtc etc aag aag 
Val Leu Lys Lys 
155 

atg acc atg gtt 
Met Thr Met Val 
170 

tec tgg gtc cct 
Ser Trp Val Pro 
185 

tac tac tac atg 
Tyr Tyr Tyr Met 
200 

tac ttg acc act 
Tyr Leu Thr Thr 

ate tac ttc tgc 
He Tyr Phe Cys 
235 

get ccc aac gtc 
Ala Pro Asn Val 
250 

ggc tgc gga etc 
Gly Cys Gly Leu 
265 

cgc att acc tac 
Arg He Thr Tyr 



125 J 

gtc aag tac 
val Lys Tyr 
140 

aag cct ctt 
Lys Pro Leu 

etc tgc ttt 
Leu Cys Phe 

att acc etc 
lie Thr Leu 
190 

cgc tec get 
Arg Ser Ala 
205 

etc cag ate 
Leu Gin He 
220 

gec tac acc 
Ala Tyr Thr 

ggc aag tgc 
Gly Lys Cys 

etc tec age 
Leu Ser Ser 
270 

aat gec aag 
Asn Ala Lys 



tgg gag 432 
Trp Glu 



gag ttc 480 
Glu Phe 
160 

gtc cag 528 
Val Gin 
175 

aac ttg 576 
Asn Leu 

gee ggt 624 
Ala Gly 



gtc cag 672 
Val Gin 

tac ttc 720 
Tyr Phe 
240 

gee ggt 7 68 

Ala Gly 

255 

tat etc 816 
Tyr Leu 

gee aag 864 
Ala Lys 
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275 280 285 ' 

gca gcc aag gag cgt gga age aac ttt acc ccc aag act gtc aag tec 912 
Ala Ala Lys Glu Arg Gly Ser Asn Phe Thr Pro Lys Thr Val Lys Ser 
290 295 300 

ggc gga teg ccc aag aag ccc tec aag age aag cac ate taa 954 
Gly Gly Ser Pro Lys Lys Pro Ser Lys Ser Lys His He 
305 310 315 



<210> 26 
<2ll> 317 
<212> PRT 

<213> Mortierella alpina 
<400> 26 

Met Ala Ala Ala lie Leu Asp Lys Val Asn Phe Gly lie Asp Gin Pro 
1 - 5 10 . 15 

Phe Gly lie Lys Leu Asp Thr Tyr Phe Ala Gin Ala Tyr Glu Leu Val 
20 25 30 

Thr Gly Lys Ser lie Asp Ser Phe Val Phe Gin Glu Gly Val Thr Pro 
35 40 45 

Leu Ser Thr Gin Arg Glu Val Ala Met Trp Thr lie Thr Tyr Phe Val 
50 55 60 

Val lie Phe Gly Gly Arg Gin lie Met Lys Ser Gin Asp Ala Phe Lys 
65 70 75 80 

Leu Lys Pro Leu Phe He Leu His Asn Phe Leu Leu Thr He Ala Ser 

85 90 . 95 

Gly Ser Leu Leu Leu Leu Phe He Glu Asn Leu Val Pro lie Leu Ala 
100 105 110 



Arg Asn Gly Leu Phe Tyr Ala He Cys Asp Asp Gly Ala Trp Thr Gin 
115 120 125 
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Arg Leu Glu Leu Leu Tyr Tyr Leu Asn Tyr Leu Val Lys Tyr Trp Glu 
130 135 140 

Leu Ala Asp Thr Val Phe Leu Val Leu Lys Lys Lys Pro Leu Glu Phe 
145 150 155 160 

Leu His Tyr Phe His His Ser Met Thr Met Val Leu Cys Phe Val Gin 
165 170 175 

Leu Gly Gly Tyr Thr Ser Val Ser Trp Val Pro lie Thr Leu Asn Leu 
180 185 190 

Thr Val His Val Phe Met Tyr Tyr Tyr Tyr Met Arg Ser Ala Ala Gly 
195 200 205 

Val Arg lie Trp Trp Lys Gin Tyr Leu Thr Thr Leu Gin lie Val Gin 
210 215 220 

Phe Val Leu Asp Leu Gly Phe lie Tyr Phe Cys Ala Tyr Thr Tyr Phe 
225 230 235 240 

Ala Phe Thr Tyr Phe Pro Trp Ala Pro Asn Val Gly Lys Cys Ala Gly 
245 . 250 ' 255 

Thr Glu Gly Ala Ala Leu Phe Gly Cys Gly Leu Leu Ser Ser Tyr Leu 
260 265 270 

Leu Leu Phe lie Asn Phe Tyr Arg lie Thr Tyr Asn Ala Lys Ala Lys 
275 280 285 

Ala Ala Lys Glu Arg Gly Ser Asn Phe Thr Pro Lys Thr Val Lys Ser 
290 295 300 

Gly Gly Ser Pro Lys Lys Pro Ser Lys Ser Lys His He 
305 310 315 



<210> 27 
<211> 1320 
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<212> DNA 

<213> Thraustochytrium 

<220> 

<221> CDS 

<222> (1)..(1320) 

<223> A5-desaturase 

<400> 27 

atg ggc aag ggc age gag ggc cgc age gcg gcg cgc gag atg acg gec 4 8 

Met Gly Lys Gly Ser Glu Gly Arg Ser Ala Ala Arg Glu Met Thr Ala 

1 5 10 15 

gag gcg aac ggc gac aag egg aaa acg att ctg ate gag ggc gtc ctg 9 6 
Glu Ala Asn Gly Asp Lys Arg Lys Thr lie Leu lie Glu Gly Val Leu 
20 25 30 

tac gac gcg acg aac ttt aag cac ccg ggc ggt teg ate ate aac ttc 144 
Tyr Asp Ala Thr Asn Phe Lys His Pro Gly Gly Ser He He Asn Phe 
35 40 45 

ttg acc gag ggc gag gec ggc gtg gac gcg acg cag gcg tac cgc gag 192 
Leu Thr Glu Gly Glu Ala Gly Val Asp Ala Thr Gin Ala Tyr Arg Glu 
50 55 _ 60 

ttt cat cag egg tec ggc aag gec gac aag tac etc aag teg ctg ccg 24 0 
Phe His Gin Arg Ser Gly Lys Ala Asp Lys Tyr Leu Lys Ser Leu Pro 
65 70 75 80 

aag ctg gat gcg tec aag gtg gag teg egg ttc teg gec aaa gag cag 2 88 
Lys Leu Asp Ala Ser Lys Val Glu Ser Arg Phe Ser Ala Lys Glu Gin 

85 90 95 

gcg egg cgc gac gee atg acg cgc gac tac gcg gec ttt cgc gag gag 3 36 
Ala Arg Arg Asp Ala Met Thr Arg Asp Tyr Ala Ala Phe Arg Glu Glu 
100 105 110 

etc gtc gee gag ggg tac ttt gac ccg teg ate ccg cac atg att tac 3 84 
Leu Val Ala Glu Gly Tyr Phe Asp Pro Ser lie Pro His Met lie Tyr 
115 120 125 



age aag cac cac gec gcg ccc aac cgc etc gag cac gat gtc gat etc 
Ser Lys His His Ala Ala Pro Asn Arg Leu Glu His Asp Val Asp Leu 
210 215 . 220 

aac acg ctg ccc ctg gtc gec ttt aac gag cgc gtc gtg cgc aag gtc 
Asn Thr Leu Pro Leu Val Ala Phe Asn Glu Arg Val Val Arg Lys Val 
225 230 235 240 

aag ccg gga teg ctg ctg gcg etc tgg ctg cgc gtg cag gcg tac etc 
Lys Pro Gly Ser Leu Leu Ala Leu Trp Leu Arg Val Gin Ala Tyr Leu 
245 250 255 

ttt gcg ccc gtc teg tgc ctg etc ate ggc ctt ggc tgg acg etc tac 
Phe Ala Pro Val Ser Cys Leu Leu He Gly Leu Gly Trp Thr Leu Tyr 
260 265 270 

ctg cac ccg cgc tac atg ctg cgc acc aag egg cac atg gag ttc gtc 
Leu His Pro Arg Tyr Met Leu Arg Thr Lys Arg His Met Glu Phe Val 
275 280 285 



432 



480 
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cgc gtc gtg gag ate gtg gcg etc ttc gcg etc teg ttc tgg etc atg 
Arg Val Val Glu He Val Ala Leu Phe Ala Leu Ser Phe Trp Leu Met 
130 135 140 

tec aag gee teg ccc acc teg etc gtg ctg ggc gtg gtg atg aac ggc 
Ser Lys Ala Ser Pro Thr Ser Leu Val Leu Gly Val Val Met Asn Gly 
145 150 155 160 

att gcg cag ggc cgc tgc ggc tgg gtc atg cac gag atg ggc cac ggg 
He Ala Gin Gly Arg Cys Gly Trp Val Met His Glu Met Gly His Gly 
165 170 175 

teg ttc acg ggc gtc ate tgg etc gac gac egg atg tgc gag ttc ttc 
Ser Phe Thr Gly Val He Trp Leu Asp Asp Arg Met Cys Glu Phe Phe 
180 185 190 

tac ggc gtc ggc tgc ggc atg age ggg cac tac tgg aag aac cag cac 624 
Tyr Gly Val Gly Cys Gly Met Ser Gly His Tyr Trp Lys Asn Gin His 
195 - 200 205 



528 



576 



672 



720 



768 



816 



864 
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tgg ate ttc gcg cgc tac att ggc tgg ttc teg etc atg ggc get etc 
Trp lie Phe Ala Arg Tyr lie Gly Trp Phe Ser Leu Met Gly Ala Leu 
290 295 300 

ggc tac teg ccg ggc acc teg gtc ggg atg tac ctg tgc teg ttc ggc 
Gly Tyr Ser Pro Gly Thr Ser Val Gly Met Tyr Leu Cys Ser Phe Gly 
305 310 315 320 

etc ggc tgc att tac att ttc ctg cag ttc gee gtc age cac acg cac 
Leu Gly Cys lie Tyr lie Phe Leu Gin Phe Ala Val Ser His Thr His 
325 330 335 

ctg ccg gtg acc aac ccg gag gac cag ctg cac tgg etc gag tac gcg 
Leu Pro Val Thr Asn Pro Glu Asp Gin Leu His Trp Leu Glu Tyr Ala 
340 345 350 

gec gac cac acg gtg aac att age acc aag tec tgg etc gtc acg tgg 
Ala Asp His Thr Val Asn lie Ser Thr Lys Ser Trp Leu Val' Thr Trp 
355 360 365 

tgg atg teg aac ctg aac ttt cag ate gag cac cac etc ttc ccc acg 
Trp Met Ser Asn Leu Asn Phe Gin lie Glu His His Leu Phe Pro Thr 
370 375 '. 380 

gcg ccg cag ttc cgc ttc aag gaa ate agt cct cgc gtc gag gee etc 
Ala Pro Gin Phe Arg Phe Lys Glu lie Ser Pro Arg Val Glu Ala Leu 
385 390 395 400 

ttc aag cgc cac aac etc ccg tac tac gac ctg ccc tac acg age gcg 
Phe Lys Arg His Asn Leu Pro Tyr Tyr Asp Leu Pro Tyr Thr Ser Ala 
405 410 415 

gtc teg acc acc ttt gee aat ctt tat tec gtc ggc cac teg gtc ggc 
Val Ser Thr Thr Phe Ala Asn Leu Tyr Ser Val Gly His Ser Val Gly 
420 425 430 



gee gac acc aag aag cag gac tga 
Ala Asp Thr Lys Lys Gin Asp 

435 440 
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<210> 28 
<211> 439 
<212> PRT 

<213> Thraustochytriuia 
<400> 28 

Met Gly Lys Gly Ser Glu Gly Arg Ser Ala Ala Arg Glu Met Thr Ala 
15 10 " 

Glu Ala Asn Gly Asp Lys Arg Lys Thr He Leu He Glu Gly val Leu 
20 25 30 

Tyr Asp Ala Thr Asn Phe Lys His Pro Gly Gly Ser He He Asn Phe 
35 40 45 

Leu Thr Glu Gly Glu Ala Gly Val Asp Ala Thr Gin Ala Tyr Arg Glu 
50 55 60 

Phe His Gin Arg Ser Gly Lys Ala Asp Lys Tyr Leu Lys Ser Leu Pro 
65 70 75 80 

Lys Leu Asp Ala Ser Lys Val Glu Ser Arg Phe Ser Ala Lys Glu Gin 

85 90 95 

Ala Arg Arg Asp Ala Met Thr Arg Asp Tyr Ala Ala Phe Arg Glu Glu 
100 105 HO 

Leu Val Ala Glu Gly Tyr Phe Asp Pro Ser He Pro His Met He Tyr 
115 120 125 

Arg Val val Glu He val Ala Leu Phe Ala Leu Ser Phe Trp Leu Met 
130 135 140 

Ser Lys Ala Ser Pro Thr Ser Leu Val Leu Gly Val Val Met Asn Gly 
145 150 155 160 



He Ala Gin Gly Arg Cys Gly Trp Val Met His Glu Met Gly His Gly 
165 170 175 
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Ser Phe Thr Gly Val lie Trp Leu Asp Asp Arg Met Cys Glu Phe Phe 
180 185 190 

Tyr Gly Val Gly Cys Gly Met Ser Gly His Tyr Trp Lys Asn Gin His 
195 200 205 

Ser Lys His His Ala Ala Pro Asn Arg Leu Glu His Asp Val Asp Leu 
210 215 220 

Asn Thr Leu Pro Leu Val Ala Phe Asn Glu Arg Val Val Arg Lys Val 
225 230 235 240 

Lys Pro Gly Ser Leu Leu Ala Leu Trp Leu Arg Val Gin Ala Tyr Leu 
245 250 255 

Phe Ala Pro Val Ser Cys Leu Leu lie Gly Leu Gly Trp Thr Leu Tyr 
260 " 265 270 

Leu His Pro Arg Tyr Met. Leu Arg Thr Lys Arg His Met Glu Phe Val 
275 280 . 285 

Trp lie Phe Ala Arg Tyr lie Gly Trp Phe Ser Leu Met Gly Ala Leu 
290 295 . 300 

Gly Tyr Ser Pro Gly Thr Ser Val Gly Met Tyr Leu Cys Ser Phe Gly 
305 310 315 320 

Leu Gly Cys lie Tyr He Phe Leu Gin Phe Ala Val Ser His Thr His 
325 330 335 

Leu Pro val Thr Asn Pro Glu Asp Gin Leu His Trp Leu Glu Tyr Ala 
340 345 350 

Ala Asp His Thr Val Asn He Ser Thr Lys Ser Trp Leu Val Thr Trp 
355 360 365 



Trp Met Ser Asn Leu Asn Phe Gin He Glu His His Leu Phe Pro Thr 
370 375 380 
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Ala Pro Gin Phe Arg Phe Lys Glu lie Ser Pro Arg Val Glu Ala Leu 
385 390 395 400 

Phe Lys Arg His Asn Leu Pro Tyr Tyr Asp Leu Pro Tyr Thr Ser Ala 
405 410 415 

Val Ser Thr Thr Phe Ala Asn Leu Tyr Ser Val Gly His Ser Val Gly 
420 425 430 

Ala Asp Thr Lys Lys Gin Asp 
435 



<210> 29 
<211> 957 
<212> DNA 

<213> Mortierella alpina 

<220> 

<221> CDS 

<222> (1). -(957) 

<223> A6-elongase 

<400> 29 

atg gag teg att gcg cca ttc etc cca tea aag atg ccg caa gat ctg 48 

Met Glu Ser He Ala Pro Phe Leu Pro Ser Lys Met Pro Gin Asp Leu 
15 10 15 

ttt atg gac ctt gec acc get ate ggt gtc egg gec gcg ccc tat gtc 96 
Phe Met Asp Leu Ala Thr Ala He Gly Val Arg Ala Ala Pro Tyr Val 
20 25 30 

gat cct etc gag gee gcg ctg gtg gec cag gec gag aag tac ate ccc 144 
Asp Pro Leu Glu Ala Ala Leu Val Ala Gin Ala Glu Lys Tyr He Pro 
35 40 45 

acg att gtc cat cac acg cgt ggg ttc ctg gtc gcg gtg gag teg cct 192 
Thr He Val His His Thr Arg Gly Phe Leu Val Ala Val Glu Ser Pro 
50 55 60 
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ttg gcc cgt gag ctg ccg ttg atg aac ccg ttc cac gVg Ctg ttg ate 240 
Leu Ala Arg Glu Leu Pro Leu Met Asn Pro Phe His Val Leu Leu lie 
65 70 75 80 

gtg etc get tat ttg gtc acg gtc ttt gtg ggc atg cag ate atg aag 288 
Val Leu Ala Tyr Leu Val Thr Val Phe Val Gly Met Gin lie Met Lys 

85 90 95 

aac ttt gag egg ttc gag gtc aag acg ttt teg etc ctg cac aac ttt 336 
Asn Phe Glu Arg Phe Glu Val Lys Thr Phe Ser Leu Leu His Asn Phe 
100 105 110 

tgt ctg gtc teg ate age gcc tac atg tgc ggt ggg ate ctg tac gag 384 
Cys Leu Val Ser He Ser Ala Tyr Met Cys Gly Gly He Leu Tyr Glu 
115 120 125 

get tat cag gcc aac tat gga ctg ttt gag aac get get gat cat acc 432 
Ala Tyr Gin Ala Asn Tyr Gly Leu Phe Glu Asn Ala Ala Asp His Thr 
130 135 140 

ttc aag ggt ctt cct atg gcc aag atg ate tgg etc ttc~tac ttc tec 480 
Phe Lys Gly Leu Pro Met Ala Lys Met He Trp Leu Phe Tyr Phe Ser 
145 150 155 160 

aag ate atg gag ttt gtc gac acc atg ate atg gtc etc aag aag aac 528 
Lys He Met Glu Phe Val Asp Thr Met He Met Val Leu Lys Lys Asn 
165 170 175 

aac cgc cag ate tec ttc ttg cac gtt tac cac cac age tec ate ttc 576 
Asn Arg Gin He Ser Phe Leu His Val Tyr His His Ser Ser He Phe 
180 185 190 

acc ate tgg tgg ttg gtc acc ttt gtt gca ccc aac ggt gaa gcc tac 624 
Thr He Trp Trp Leu Val Thr Phe Val Ala Pro Asn Gly Glu Ala Tyr 
195 200 205 



ttc tct get gcg ttg aac teg ttc ate cat gtg ate atg tac ggc tac 
Phe Ser Ala Ala Leu Asn Ser Phe He His Val He Met Tyr Gly Tyr 
210 215 220 



672 
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tac ttc ttg teg gec ttg ggc ttc aag cag gtg teg ttc ate aag ttc 72 0 
Tyr Phe Leu Ser Ala Leu Gly Phe Lys Gin Val Ser Phe lie Lys Phe 
225 230 235 240 

tac ate acg cgc teg cag atg aca cag ttc tgc atg atg teg gtc cag 768 
Tyr lie Thr Arg Ser Gin Met Thr Gin Phe Cys Met Met Ser Val Gin 
245 250 255 

tct tec tgg gac atg tac gee atg aag gtc ctt ggc cgc ccc gga tac 816 
Ser Ser Trp Asp Met Tyr Ala Met Lys Val Leu Gly Arg Pro Gly Tyr 
260 265 270 

ccc ttc ttc ate acg get ctg ctt tgg ttc tac atg tgg ace atg etc 8 64 
Pro Phe Phe lie Thr Ala Leu Leu Trp Phe Tyr Met Trp Thr Met Leu 
275 280 285 

ggt etc ttc tac aac ttt tac aga aag aac gee aag ttg gec aag cag 912 
Gly Leu Phe Tyr Asn Phe Tyr Arg Lys Asn Ala Lys Leu Ala Lys Gin 
290 295 300 

gec aag gec gac get gee aag gag aag gca agg aag ttg cag taa 957 
Ala Lys Ala Asp Ala Ala Lys Glu Lys Ala Arg Lys Leu Gin 
305 310 315 



<210> 30 
<211> 3X8 
<212> PRT 

<213> Mortierella alpina 
<400> 30 

Met Glu Ser lie Ala Pro Phe Leu Pro Ser Lys Met Pro Gin Asp Leu 
15 10 15 

Phe Met Asp Leu Ala Thr Ala lie Gly Val Arg Ala Ala Pro Tyr Val 
20 25 30 

Asp Pro Leu Glu Ala Ala Leu Val Ala Gin Ala Glu Lys Tyr lie Pro 
35 40 45 
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Thr lie Val His His Thr Arg Gly Phe Leu Val Ala Val Glu Ser Pro 
50 55 60 

Leu Ala Arg Glu Leu Pro Leu Met Asn Pro Phe His Val Leu Leu lie 
65 70 75 80 

Val Leu Ala Tyr Leu Val Thr Val Phe Val Gly Met Gin lie Met Lys 

85 90 95 

Asn Phe Glu Arg Phe Glu Val Lys Thr Phe Ser Leu Leu His Asn Phe 
100 105 110 



Cys Leu Val Ser lie Ser Ala Tyr Met Cys Gly Gly lie Leu Tyr Glu 
115 120 125 

Ala Tyr Gin Ala Asn Tyr Gly Leu Phe Glu Asn Ala Ala Asp His Thr 

* 130 - 135 - 140 1 ~ 

Phe Lys Gly Leu Pro Met Ala Lys Met lie Trp Leu Phe Tyr Phe Ser 
145 150 155 160 

Lys lie Met Glu Phe Val Asp Thr Met lie Met Val Leu Lys Lys Asn 
165 170 175 

Asn Arg Gin lie Ser Phe Leu His Val Tyr His His Ser Ser lie Phe 
180 185 190 

Thr lie Trp Trp Leu Val Thr Phe Val Ala Pro Asn Gly Glu Ala Tyr 
195 200 205 

Phe Ser Ala Ala Leu Asn Ser Phe lie His Val lie Met Tyr Gly Tyr 
210 215 220 

Tyr Phe Leu Ser Ala Leu Gly Phe Lys Gin Val Ser Phe He Lys Phe 
225 230 235 240 

Tyr He Thr Arg Ser Gin Met Thr Gin Phe Cys Met Met Ser Val Gin 
245 250 255 

Ser Ser Trp Asp Met Tyr Ala Met Lys Val Leu Gly Arg Pro Gly Tyr 
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Pro Phe Phe He Thr Ala Leu Leu 
275 280 

Gly Leu Phe Tyr Asn Phe Tyr Arg 
290 295 

Ala Lys Ala Asp Ala Ala Lys Glu 
305 310 



80 

265 * 270 

Trp Phe Tyr Met Trp Thr Met Leu 
285 

Lys Asn Ala Lys Leu Ala Lys Gin 
300 

Lys Ala Arg Lys Leu Gin 
315 



<210> 31 
<211> 1374 
<212> DNA 

<213> Mortierella alpina 

<220> 

<221> CDS 

<222> (1)..(1374) 

<223> A6-desaturase 



<400> 31 

atg get get get ccc agt gtg agg 
Met Ala Ala Ala Pro Ser Val Arg 
1 5 

aat gec gag get ctg aat gag ggc 
Asn Ala Glu Ala Leu Asn Glu Gly 
20 



acg ttt act egg gee. gag gtt ttg 4 8 
Thr Phe Thr Arg Ala Glu Val Leu 
10 is 

aag aag gat gec gag gca ccc ttc 9 6 
Lys Lys Asp Ala Glu Ala Pro Phe 
25 30 



ttg atg ate ate gac aac aag gtg tac gat gtt cgc gag ttc gtc cct 144 
Leu Met He He Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro 
35 40 45 



gat cat ccc ggt gga agt gtg att etc acg cac gtt ggc aag gac ggc 
Asp His Pro Gly Gly Ser Val He Leu Thr His Val Gly Lys Asp Gly 
50 55 60 



192 



gac gtc ttt gac act ttt cac ccc gag get get tgg gag act ctt 240 
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Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Olu Thr Leu 
65 70 75 80 

gcc aac ttt tac gtt ggt gat att gac gag age gac cgc gat ate aag 288 
Ala Asn Phe Tyr Val Gly Asp lie Asp Glu Ser Asp Arg Asp lie Lys 

85 90 95 

aat gat gac ttt gcg gcc gag gtc cgc aag ctg cgt acc ttg ttc cag 336 
Asn Asp Asp Phe Ala Ala Glu Val Arg Lys Leu Arg Thr Leu Phe Gin 
100 105 110 

tct ctt ggt tac tac gat tct tec aag gca tac tac gcc ttc aag gtc 384 
Ser Leu Gly Tyr Tyr Asp Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val 
115 120 125 

teg ttc aac etc tgc ate tgg ggt ttg teg acg gtc att gtg gcc aag 432 
Ser Phe Asn Leu Cys lie Trp Gly Leu Ser Thr Val He Val Ala Lys 
130 135 140 

tgg ggc cag acc teg acc etc gcc aac gtg etc teg get gcg ctt ttg 480 
Trp Gly Gin Thr Ser Thr Leu Ala Asn Val Leu Ser Ala Ala Leu Leu 
145 150 155 - - 160 

ggt ctg ttc tgg cag cag tgc gga tgg ttg get cac gac ttt ttg cat 52 8 
Gly Leu Phe Trp Gin Gin Cys Gly Trp Leu Ala His Asp Phe Leu His 
165 170 175 

cac cag gtc ttc cag gac cgt ttc tgg ggt gat ctt ttc ggc gcc ttc 57 6 
His Gin Val Phe Gin Asp Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe 
180 185 190 

ttg gga ggt gtc tgc cag ggc ttc teg tec teg tgg tgg aag gac aag 62 4 
Leu Gly Gly Val Cys Gin Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys 
195 200 205 

cac aac act cac cac gcc gcc ccc aac gtc cac ggc gag gat ccc gac 67 2 
His Asn Thr His His Ala Ala Pro Asn Val His Gly Glu Asp Pro Asp 
210 215 220 



att gac acc cac cct ctg ttg acc tgg agt gag cat gcg ttg gag atg 



720 
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lie Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met 
225 230 235 240 

ttc teg gat gtc cca gat gag gag ctg acc cgc atg tgg teg cgt ttc 
Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe 
245 250 255 

atg gtc ctg aac cag acc tgg ttt tac ttc ccc att etc teg ttt gec 
Met Val Leu Asn Gin Thr Trp Phe Tyr Phe Pro He Leu Ser Phe Ala 
260 265 270 



768 



816 



cgt etc tec tgg tgc etc cag tec att etc ttt gtg ctg cct aac ggt 8 64 
Arg Leu Ser Trp Cys Leu Gin Ser He Leu Phe Val Leu Pro Asn Gly 
275 280 285 



cag gec cac aag ccc teg ggc gcg cgt 
Gin Ala His Lys Pro Ser Gly Ala Arg 
290 295 

cag ctg teg ctt gcg atg cac tgg acc 
Gin Leu Ser Leu Ala Met His Trp Thr 
305 310 

ctg ttc ate aag gat ccc gtc aac atg 
Leu Phe He Lys Asp Pro Val Asn Met 
325 

cag gcg gtg tgc gga aac ttg ttg gcg 
Gin Ala Val Cys Gly Asn Leu Leu Ala 
340 345 



gtg ccc ate teg ttg gtc gag 912 
Val Pro He Ser Leu Val Glu 
300 

tgg tac etc gec acc atg ttc 960 
Trp Tyr Leu Ala Thr Met Phe 
315 320 

ctg gtg tac ttt ttg gtg teg 1008 
Leu Val Tyr Phe Leu Val Ser 
330 335 

ate gtg ttc teg etc aac cac 1056 
He Val Phe Ser Leu Asn His 
350 



aac ggt atg cct gtg ate 

Asn Gly Met Pro Val He 
355 

ttc acg aag cag ate ate 

Phe Thr Lys Gin He He 
370 

gee aac tgg ttc acg ggt 



teg aag gag gag gcg gtc 
Ser Lys Glu Glu Ala Val 
360 

acg ggt cgt gat gtc cac 
Thr Gly Arg Asp Val His 
375 380 

gga ttg aac tat cag ate 



gat atg gat ttc 1104 

Asp Met Asp Phe 

365 

ccg ggt eta ttt 1152 
Pro Gly Leu Phe 

gag cac cac ttg 1200 
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Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gin lie Glu His His Leu 
385 390 395 400 

ttc cct teg atg cct cgc cac aac ttt tea aag ate cag cct get gtc 124 8 
Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys He Gin Pro Ala Val 
405 410 415 

gag ace ctg tgc aaa aag tac aat gtc cga tac cac acc acc ggt atg 12 96 
Glu Thr Leu Cys Lys Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met 
420 425 430 

ate gag gga act gca gag gtc ttt age cgt ctg aac gag gtc tec aag 1344 
lite Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
435 440 445 

get gee tec aag atg ggt aag gcg cag taa 1374 
Ala Ala Ser Lys Met Gly Lys Ala Gin 
450 455 



<210> 32 
<211> 457 
<212> PRT 

<213> Mortierella alpina 
<400> 32 

Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Val Leu 
1 5 10 15 

Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe 
20 25 30 

Leu Met He He Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro 
35 40 45 

Asp His Pro Gly Gly Ser Val He Leu Thr His Val Gly Lys Asp Gly 
50 55 60 



Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Glu Thr Leu 
65 70 75 80 
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Ala Asn Phe Tyr Val Gly Asp lie Asp Glu Ser Asp Arg Asp lie Lys 
85 90 95 

Asn Asp Asp Phe Ala Ala Glu Val Arg Lys Leu Arg Thr Leu Phe Gin 
100 105 110 

Ser Leu Gly Tyr Tyr Asp Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val 
115 120 125 

Ser Phe Asn Leu Cys He Trp Gly Leu Ser Thr Val He Val Ala Lys 
130 135 140 

Trp Gly Gin Thr Ser Thr Leu Ala Asn Val Leu Ser Ala Ala Leu Leu 
145 150 155 * 160 

Gly Leu Phe Trp Gin Gin Cys Gly Trp Leu Ala His .Asp Phe Leu His 
165 170 175 

His Gin Val Phe Gin Asp Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe 
180 185 190 

Leu Gly Gly Val Cys Gin Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys 
195 200 205 

His Asn Thr His His Ala Ala Pro Asn Val His Gly Glu Asp Pro Asp 
210 215 220 

He Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met 
225 230 235 240 

Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe 
245 250 255 

Met val Leu Asn Gin Thr Trp Phe Tyr Phe Pro He Leu Ser Phe Ala 
260 265 270 



Arg Leu Ser Trp Cys Leu Gin Ser He Leu Phe Val Leu Pro Asn Gly 
275 280 285 
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Gin Ala His Lys Pro Ser Gly Ala Arg Val Pro lie Ser Leu Val Glu 
290 295 300 

Gin Leu Ser Leu Ala Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe 
305 310 315 320 

Leu Phe lie Lys Asp Pro Val Asn Met Leu Val Tyr Phe Leu Val Ser 
325 330 335 

Gin Ala Val Cys Gly Asn Leu Leu Ala lie Val Phe Ser Leu Asn His 
340 345 350 

Asn Gly Met Pro Val lie Ser Lys Glu Glu Ala Val Asp Met Asp Phe 
355 360 365 

Phe Thr Lys Gin lie Tie Thr Gly Arg Asp Val His Pro Gly Leu Phe 

. 370 375 380 

Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gin lie Glu His His Leu 

385 390 395 400 

Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys lie Gin Pro Ala Val 
405 410 415 

Glu Thr Leu Cys Lys Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met 
420 425 430 

lie Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
435 440 445 

Ala Ala Ser Lys Met Gly Lys Ala Gin 
450 455 



<210> 33 
<211> 3598 
<212> DNA 
<213> Unknown 



<220> 
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<223> Sequence constitutes a plant 

promoter-terminator expression cassette in vector 

pUC19 
<400> 33 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 2 40 
attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300 
tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360 
tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cggcgcgccg agctcctcga 420 
gcaaatttac acattgccac taaacgtcta aacccttgta atttgttttt gttttactat 480 
gtgtgttatg tatttgattt gcgataaatt tttatatttg gtactaaatt tataacacct 540 
tttatgctaa cgtttgccaa cacttagcaa. tttgcaagtt gattaattga ttctaaatta £00 
tttttgtctt ctaaatacat atactaatca actggaaatg taaatatttg ctaatatttc 660 
tactatagga gaattaaagt gagtgaatat ggtaccacaa ggtttggaga tttaattgtt 720 
gcaatgctgc atggatggca tatacaccaa acattcaata attcttgagg ataataatgg 780 
taccacacaa gatttgaggt gcatgaacgt cacgtggaca aaaggtttag taatttttca 840 
agacaacaat gttaccacac acaagttttg aggtgcatgc atggatgccc tgtggaaagt 900 
ttaaaaatat tttggaaatg atttgcatgg aagccatgtg taaaaccatg acatccactt 960 
ggaggatgca ataatgaaga aaactacaaa tttacatgca actagttatg catgtagtct 1020 
atataatgag gattttgcaa tactttcatt catacacact cactaagttt tacacgatta 108 0 
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taatttcttc a-tagccagcc caccgcggtg ggcggccgcc tgcagtctag aaggcctcc/t 1140 
gctttaatga gatatgcgag acgcctatga tcgcatgata tttgctttca attctgttgt: 1200 
gcacgttgta aaaaacctga gcatgtgtag ctcagatcct taccgccggt ttcggttcat 12 60 
tctaatgaat a-tatcacccg t/tactatcgt atttttatga ataatattct ccgttcaatt 1320 
tactgattgt ccgtcgacga attcgagctc ggcgcgccaa gcttggcgta atcatggtca 1380 
tagctgtttc c-tgtgtgaaa ttgttatccg ctcacaattc cacacaacat acgagccgga 1440 
agcataaag-t g*taaagcct.g gggtgcctaa tgagtgagct aactcacatt aattgcgttg 1500 
cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc 1560 
caacgcgcgg ggagaggcgg -tttgcgtatt gggcgctctt. ccgcttcctc gctcactgac 1620 
tcgctgcgc-t cggtcg-ttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata 1680 
cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa 1740 
aaggccagga accgtaaaaa ggccgcgttg. ctggcgttt.t tcca-taqgct: ccgcccccct 1800 
gacgagcatc acaaaaatcg acgctcaagt. cagaggtggc gaaacccgac aggactataa 1860 
agataccagg cgtttccccc tggaagctcc c-tcgtgcgct ctcctgttcc gaccctgccg 1920 
cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgc-tttc tcatagctca 1980 
cgctgtaggt atctcagtitc ggtgtaggtc gttcgctcca agc-tgggctg tgtgcacgaa 2040 
ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtc-fc-tga gtccaacccg 2100 
gtaagacacg acttatcgcc actggcagca gccactggta acaggattag cagagcgagg 2160 
tatgtaggcg gtgctacaga gttcttgaag tgg-tggccta actacggcta cactagaagg 2220 
acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc 2280 
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tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag 2340 
attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac 2400 
gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc 24 6 0 
ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag 252 0 
taaacttggi: ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt. 2580 
ctatttcgtt catccatagt tgcctgactc cccgtcgtgt agataactac gatacgggag 2640 
ggcttaccat ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca 2700 
gatttatcag caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact. 2760 
ttatccgcct ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca 282 0 
gttaatagtt tgcgcaacgt. tgttgccatt gctacaggca tcgtggtgtc acgctcgteg 2880 
tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc 294 0 
atgttgtgca aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aag-taagttg 3000 
gccgcagtgt tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca 3060 
tccgtaagat gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt 3120 
atgcggcgac cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc 3180 
agaactttaa aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc 3240 
ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca 3300 
tct-tttactt tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa 3360 
aagggaataa gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatattat 3420 
tgaagcattt atcagggtta -ttgtctcatg agcggataca tatttgaatg tatttagaaa 3 4 80 
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aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga cgtctaagaa 3540 
accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc ctttcgtc 3598 

<210> 34 

<211> 3590 

<212> DNA 

<213> Unknown 

<220> 

<2'23> Sequence constitutes a plant 

promoter-terminator expression cassette in vector 
pUC19 

<400> 34 

tcgcgcgttt cggtgatgac ggtgaaaacc _ tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 - 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 18 0 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240 
attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300 
tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360 
tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cggcgcgccg agctcctcga 420 
gcaaatttac acattgccac taaacgtcta aacccttgta atttgttttt gttttactat 480 
gtgtgttatg tatttgattt gcgataaatt tttatatttg gtactaaatt tataacacct 540 
tttatgctaa cgtttgccaa cacttagcaa tttgcaagtt gattaattga ttctaaatta 6 00 
tttttgtctt ctaaatacat atactaatca actggaaatg taaatatttg ctaatatttc 6 60 
tactatagga gaattaaagt gagtgaatat ggtaccacaa ggtttggaga tttaattgtt 720 
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gcaatgctgc atggatggca tatacaccaa acattcaata attcttgagg ataataatgg 780 
taccacacaa gatttgaggt gcatgaacgt cacgtggaca aaaggtttag taatttttca 840 
agacaacaat g-ttaccacac acaagttttg aggtgcatgc atggatgccc tgtggaaagt 900 
ttaaaaatat tttggaaatg atttgcatgg aagccatgtg taaaaccatg acatccactt 960 
ggaggatgca ataatgaaga aaactacaaa tttacatgca actagttatg catgtagtct 1020 
atataatgag gattttgcaa tactttcatt catacacact cactaagttt tacacgatta 1080 
taatttcttc atagccagcg gatccgatat cgggcccgct agcgttaacc ctgctttaat 114 0 
gagatatgcg agacgcctat gatcgcatga tatttgcttt caa-btctgtt gtgcacgttg 12 00 
taaaaaacct gagcatg-tg-t agctcagatc cttaccgccg g-tttcggttc attctaatga 1260 
atatatcacc cgttactatc gtatttttat gaataatatt ctccgttcaa tt/tactgatt 1320 
gtccgtcgac gaattcgagc tcggcgcgcc aagcttggcg taatcatggt catagctgtt. 1380 
tcctgtgtga aattgttatc cgctcacaat tccacacaac atacgagccg- gaagcataaa -1440 
gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca ttaattgcgt tgcgctcact 150 0 
gcccgct-ttc cagtcgggaa acctgtcgtg ccagctgcat taatgaatcg gccaacgcgc 156 0 
ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg 1620 
ctcggtcgtt cggctgcggc gagcggtatc agctcactca aaggcggtaa tacggttatc 1680 
cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc aaaaggccag 1740 
gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg ctccgccccc ctgacgagca 1800 
tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg acaggactat aaagatacca 1860 
ggcgtttccc cctggaagct ccctcgt.gcg ctctcctgtt ccgaccctgc cgcttaccgg 1920 
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atacctgtcc gcctttct.cc cttcgggaag cgtggcgctt tctcatagct cacgctgtag 19 80 

gtatctcagt tcggtgtagg tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt 2 04 0 

tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt gagtccaacc cggtaagaca 2100 

cgacttatcg ccactggcag cagccactgg taacaggatt agcagagcga ggtatgtagg 2160 

cggtgctaca gagttcttga agtggtggcc taactacggc tacactagaa ggacagtatt 2 220 

tggtatctgc gctctgctga agccagttac cttcggaaaa agagttggta gctcttgatc 2 2 80 

cggcaaacaa accaccgctg gtagcggtgg tttttttgtt tgcaagcagc agattacgcg 2340 

cagaaaaaaa. ggatctcaag aagatccttt gatcttttct acggggtctg acgctcagtg 2400 

gaacgaaaac tcacgttaag ggattttggt catgagatta tcaaaaagga tcttcaccta 2460 

<jatcctttta aattaaaaat gaagttttaa afcaatcta'a "agtatatatg agtaaacttg^2 52 0 

gtctgacagt taccaatgct taatcagtga ggcacctatc tcagcgatct gtctatttcg '2580 

ttcatccata gttgcctgac tccccgtcgt. gtagataact acgatacggg agggcttacc 2640 

atctggcccc agtgctgcaa tgataccgcg agacccacgc tcaccggctc cagatttatc 2 7 00 

agcaataaac cagccagccg gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc 2760 

ctccatccag tctattaatt gttgccggga agctagagta agtagttcgc cagttaatag 2820 

tttgcgcaac gttgttgcca ttgctacagg catcgtggtg tcacgctcgt cgtttggtat 2880 

ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg 29 40 

caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt 3 000 

gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag 3 0 60 
atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg 3120 
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accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt 3180 
aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct 3240 
gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac 3300 
tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat 3360 
aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat 3420 
ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca 3480 
aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgtctaag aaaccattat 3540 
tatcatgaca ttaacctata aaaataggcg tatcacgagg ccctttcgtc 3590 

<210> 35 

<211>_ 3564 - ' Vr v .. ■■ : ~ 

<212> DNA 
<213> Unknown 

<220> 

<223> Sequence constitutes a plant 

promoter-terminator expression cassette in vector 
pUC19 

<400> 35 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 2 40 

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300 

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 3 60 
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fttcccagtc acgacgt/tgt aaaacgacgg ccagtgaat/t cggcgcgccg agctcc-tcga 4 20 
gcaaa-tttac acat"tgccac taaacgtcta aaccct-tgta at-ttg-t-tt-tt gt-tttactat 4 80 
gtgtgttatg tatttgattt. gcgataaatt tttatatttg gtactaaatt tataacacct 540 
tttatgctaa cgtttgccaa cacttagcaa -tttgcaagtt gattaattga ttctaaafta 600 
tttttgtctt, ctaaatacat atactaatca actggaaatg taaatatttg ctaatatttc 6 60 
tacta-tagga gaattaaagt gag-tgaatat ggtaccacaa gg-tttggaga tt-taattgtt, 7 20 
gcaatgctgc atggatggca tatacaccaa acattcaata attcttgagg ataa-taatgg 7 80 
taccacacaa gatt-tgaggt gcatgaacgt cacgtggaca aaagg-tttag taatttttca 840 
agacaacaat gttaccacac acaagttttg aggtgcatgc atggatgccc tg-tggaaagt 900 
ttaaaaatat tttggaaatg atttgcatgg aagccatgtg taaaaccatg acatccactt 960 
ggaggatgca ataa-tgaaga aaactacaaa* tttacatgca actagttatg cat.gt:agtct: 1020 
atataatgag gattttgcaa tactttcatt . catacacact cactaagttt: tacacgatta 1080 
taatttcttc atagccagca gatctgccgg catcgatccc gggccatggc ctgctttaat. 1140 
gagatatgcg agacgcctat. gatcgcatga tatttgcttt: caattctgtt gtgcacgttg 1200 
taaaaaacct gagcatgtgt agctcagatc cttaccgccg gtttcggttc attc-taatga 1260 
atatatcacc cgttactatc gtatttttat. gaataatatt ctccgttcaa tttactgatt 1320 
gtccgtcgac gagctcggcg cgccaagctt ggcgtaatca tggtcatagc tgtttcctgt 1380 
gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca taaagtgtaa 1440 
agcctggggt gcctaatgag tgagctaact. cacattaatt gcgt/tgcgct cactgcccgc 1500 
tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac gcgcggggag 1560 
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aggcggtttg cgtattgggc gctcttccgc 
cgttcggctg cggcgagcgg tatcagctca 
atcaggggat aacgcaggaa agaacatgtg 
taaaaaggcc gcgttgctgg cgtttttcca 
aaatcgacgc tcaagtcaga ggtggcgaaa 
tccccctgga agcrtccctcg tgcgctctcc 
gtccgccttt ctcccttcgg gaagcgtggc 
cagttcggtg taggtcgttc gctccaagct 
cgaccgctgc gccttatccg gtaactatcg 
atcgccactg gcagcagcca ctggtaacag 
tacagagttc ttgaagtggt ggcctaacta 
ctgcgctctg ctgaagccag ttaccttcgg 
acaaaccacc gctggtagcg gtggt.tt.ttt 
aaaaggatct caagaagatc ctttgatctt 
aaactcacgt taagggattt tggtcatgag 
tttaaattaa aaatgaagtt ttaaatcaat 
cagttaccaa tgcttaatca gtgaggcacc 
catagttgcc tgactccccg tcgtgtagat 
ccccagtgct gcaatgatac cgcgagaccc 
aaaccagcca gccggaaggg ccgagcgcag 
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ttcctcgctc actgactcgc tgcgctcggt 1620 
ctcaaaggcg gtaatacggt tatccacaga 1680 
agcaaaaggc cagcaaaagg ccaggaaccg 1740 
taggctccgc ccccctgacg agcatcacaa 1800 
cccgacagga ctataaagat accaggcgtt 1860 
tgttccgacc ctgccgctta ccggatacct 1920 
gctttctcat agctcacgct gtaggtatct 1980 
gggctgtgtg cacgaacccc ccgttcagcc 2040 
tcttgagtcc aacccggtaa gacacgactt 2100 
gattagcaga gcgaggtatg taggcggtgc 2160 
cggctacact agaaggacag tatttggtat 2220 
aaaaagagtt ggtagctctt gatccggcaa 22 80 
tgtttgcaag cagcagatta cgcgcagaaa 2340 
ttctacgggg tctgacgctc agtggaacga 2400 
attatcaaaa aggatcttca cctagatcct 2460 
ctaaagtata tatgagtaaa cttggtctga 2520 
tatctcagcg atctgtctat ttcgttcatc 2580 
aactacgata cgggagggct taccatctgg 2 640 
acgctcaccg gctccagatt tatcagcaat 2700 
aagtggtcct gcaactttat ccgcctccat 2 760 
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ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 2 820 

caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 2 880 

attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 2 940 

agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 3000 

actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 3060 

ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 3120 

ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctt.taaaagt 3180 

gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 32 4 0 

atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 3 30 0 

cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 3360 

gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 3420 
gggttattgt ctcatgagcg gatacatatt - tgaatgtatt tagaaaaata aacaaatagg 3480 

ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat 3540 

gacattaacc tataaaaata ggcgtatcac gaggcccttt cgtc 3584 



<210> 36 
<211> 4507 
<212> DNA 
<213> Unknown 

<220> 

<2 2 3> Sequence constitutes a plant 

promoter-terminator expression cassette in vector 
pUC19 
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<400> 36 

tcgcgcgttt cggtga-tgac gg-tgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagag-tgc 180 
accatatgcg gtgtgaaata ccgcacagat: gcgtaaggag aaaataccgc atcaggcgcc 240 
attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300 
tacgccagct. ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360 
tttcccagtc acgacgtt.gt aaaacgacgg ccagtgaatt cggcgcgccg agctcctcga 420 
gcaaatttac acattgccac taaacgtcta aacccttgta atttgttttt gttttactat 480 
gtgt.gt.tatg tatttgattt gcgataaatt tttatatttg gtactaaatt tataacacct 54 0 
t-ttatgctaa cgtttgccaa cacttagcaa tttgcaagtt gattaattga ttctaaa-bta 600 
tttttgtctt ctaaatacat atactaatca actggaaatg taaatatttg crtaatatttc 660 
tactatagga gaattaaagt gagtgaatat ggtaccacaa ggtttggaga tttaattgtt 720 
gcaatgctgc atggatggca tatacaccaa acattcaata attcttgagg ataataatgg 780 
taccacacaa gatttgaggt gcatgaacg-t cacgtggaca aaaggt/ttag taatttttca 840 
agacaacaat gttaccacac acaagttttg aggtgcatgc atggatgccc tgtggaaagt 9 00 
ttaaaaatat tttggaaatg atttgcatgg aagccatg-tg taaaaccatg acatccactt 960 
ggaggatgca ataatgaaga aaactacaaa tttacatgca actagttatg catgtagtct 1020 
atataatgag gattttgcaa tactttcatt catacacact cactaagttt tacacgatta 1080 
taatttc-ttc atagccagcc caccgcggtg ggcggccgcc tgcagtctag aaggcctcct 1140 
gct/ttaatga gatatgcgag acgcctatga tcgcatgata tttgctttca attctgttgt. 1200 
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gcacgttgta aaaaacctga gcatgtgtag ctcagatcct taccgccggt ttcggttcat. 1260 
tctaatgaat atatcacccg ttactatcgt atttttatga ataatattct ccgttcaatt 1320 
tactgattgt ccgtcgagca aa-tttacaca ttgccactaa acgtctaaac ccrttgtaatt 13 80 
tgttt/ttgtt ttactatgtg tgttatgtat ttgatttgcg ataaattttt atatttggta 1440 
ctaaatttat aacacctttt atgctaacgt ttgccaacac ttagcaattt gcaagttgat, 1500 
taattgattc taaattattt ttgtcttcta aatacatata ctaatcaact ggaaatgtaa 1560 
atatttgcta atatttctac tataggagaa ttaaagtgag tgaatatggt accacaaggt 162 0 
ttggagattt aa-t-tgttgca atgctgcatg gatggcatat acaccaaaca ttcaataatt 1680 
cttgaggata ataatggtac cacacaagat. ttgaggtgca tgaacgtcac gtggacaaaa 1740 
ggtttagtaa -ttt.t/tcaaga- caacaatgtt accacacaca. agttttgagg tgcatgcafcg 1800 
gatgccctgt ggaaagttta aaaatatttt. ggaaatga-tt tgcatggaag ccatgtgtaa 1860 
aaccatgaca tccacttgga ggatgcaata - atgaagaaaa ctacaaattt acatgcaact 1920 
ag-ttatgcat gtagtctata taatgaggat tttgcaatac tttcattcat acacactcac 1980 
taagttttac acgattataa tttc-ttcata gccagcggat ccgatatcgg gcccgctagc 2 04 0 
gttaaccctg ctttaatgag atatgcgaga cgcctatgat cgcatgatat ttgctttcaa 2100 
ttctgttgtg cacgttgtaa aaaacctgag catgtgtagc tcagatcctt accgccggtt 216 0 
tcggttcatt ctaatgaata tatcacccgt. tactatcgta tttttatgaa taataftctc 222 0 
cgt-tcaattt actgattgtc cgtcgacgaa ttcgagctcg gcgcgccaag ct-tggcgtaa 2280 
tcatggtcat agctgtttcc tg-tgtgaaat tgttatccgc tcacaattcc acacaacata 2340 
cgagccggaa gcataaagtg taaagcctgg ggtgcctaat gagtgagcta actcacatta 2400 
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attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa 24 60 
tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg 2520 
ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag 2580 
gcggtaa-tac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa 2640 
ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgt-tttt ccataggctc 2700 
cgcccccctg acgagcatca caaaaatcga cgct.caagtc agaggtggcg aaacccgaca 2760 
ggactataaa gataccaggc gtttccccct ggaagctccc tcg-tgcgctc tcctgttccg 2820 
accctgccgc ttaccggata cctgtccgcc -tttctccctt cgggaagcgt ggcgctttct 2 880 
catagct-cac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt 2940 
gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag 3000 
tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc 3060 
agagcgaggt atgtaggcgg tgctacagag_ ttcttgaagt ggtggcctaa ctacggctac -3120 
actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga 3180 
gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt tttt.gtt-tgc 3240 
aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg 3300 
gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca 3360 
aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt 3420 
atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca 3480 
gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta gataactacg 3540 
atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga cccacgctca 3600 
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ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg cagaagtggt 3660 

cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc tagagtaagt 3720 

agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca 3780 

cgctcgtcgt ttggtatggc ttcattcagc -tccggttccc aacgatcaag gcgagttaca 3840 

tgatccccca tg-ttgtgcaa aaaagcggtt agctccttcg gtcc-tccgat cgttgtcaga 3900 

agtaagt-tgg ccgcagtg-tt atcactcatg gttatggcag cactgcataa ttctcttact 3960 

gtcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa gtcattctga 402 0 

gaat.agt.gta tgcggcgacc gagttgctct tgcccggcgt caatacggga taataccgcg 4080 

ccacatagca gaactt/taaa agtgctcatc a-ttggaaaac gttcttcggg gcgaaaactc 4140 

tcaaggatct taccgc-tg-tt gagatccagt tcgatg-taac ccactcgtgc acccaactga 4200 

tcttcagcat cttttacttt caccagcgtt. tctggg-tgag caaaaacagg aaggcaaaat 4260 

gccgcaaaaa agggaataag ggcgacacgg. aaatgttgaa tactcatact cttccttttt 4320 

caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat atttgaatgt 4380 

atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt gccacctgac 4440 

gtctaagaaa ccattattat catgacatta acctataaaa ataggcgtat cacgaggccc 4500 

tttcgtc 4507 



<210> 37 
<211> 5410 
<212> DNA 
<213> Unknown 



<220> 
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<223> Sequence constitutes a plant 

promoter-terminator expression cassette in vector 
pUCl9 

<400> 37 

ttttggaaat gatttgcatg gaagccatgt gtaaaaccat gacatccact tggaggatgc 60 
aataatgaag aaaactacaa atttacatgc aactagttat gcatgtagtc tatataatga 120 
ggattttgca atactttcat tcatacacac tcactaagtt ttacacgatt ataatttctt 180 
catagccagc ggatccgata tcgggcccgc tagcgttaac cctgctttaa tgagatatgc 24 0 
gagacgccta tgatcgcatg atatttgctt tcaattctgt tgtgcacgtt gtaaaaaacc 300 
tgagcatgtg tagctcagat ccttaccgcc ggtttcggtt cattctaatg aatatatcac 360 
ccgttactat cgtattttta tgaataatat tctccgttca atttactgat tgtccgtcga 420 
gcaaatttac acattgccac taaacgtcta aacccttgta atttgttttt gttttactat 480 
gtgtgttatg tatttgattt gcgataaatt tttatatttg gtactaaatt tataacacct 540 
tttatgctaa cgtttgccaa cacttagcaa tttgcaagtt gattaattga ttctaaatta 600' 
tttttgtctt ctaaatacat atactaatca actggaaatg taaatatttg ctaatatttc 660 
tactatagga gaattaaagt gagtgaatat ggtaccacaa ggtttggaga tttaattgtt 720 
gcaatgctgc atggatggca tatacaccaa acattcaata attcttgagg ataataatgg 780 
taccacacaa gatttgaggt gcatgaacgt cacgtggaca aaaggtttag taatttttca 840 
agacaacaat gttaccacac acaagttttg aggtgcatgc atggatgccc tgtggaaagt 900 
ttaaaaatat tttggaaatg atttgcatgg aagccatgtg taaaaccatg acatccactt 960 
ggaggatgca ataatgaaga aaactacaaa tttacatgca actagttatg catgtagtct 1020 
atataatgag gattttgcaa tactttcatt catacacact cactaagttt tacacgatta 1080 
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taatttcttc atagccagca gatctgccgg catcgatccc gggccatggc ctgctttaat. 1140 

gagatatgcg agacgcctat gatcgcatga tatttgcttt caattctgtt gtgcacgttg 12 00 

taaaaaacct. gagcatgtgt agctcagatc cttaccgccg gtttcggttc attctaatga 12 60 

atatatcacc cgttactatc gtatttttat gaataatatt ctccg-ttcaa tttactgatt 1320 

gt.ccgt.cgac gagctcggcg cgccaagctt ggcgtaatca tggtcatagc tgttt.cctgt. 1380 

g-tgaaattgt tatccgctca caattccaca caacatacga gccggaagca taaagtgtaa 1440 

agcctggggt gcctaatgag tgagctaact. cacattaatt gcgttgcgct cactgcccgc 1500 

tttccagtcg ggaaacctgt cgtgccagct gcatt.aat.ga atcggccaac gcgcggggag 1560 

aggcggtttg cgtat-tgggc gctcttccgc ttcctcgctc actgactcgc fcgcgctcggt. 1620 

cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 1680 

atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 1740 

taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 1800 

aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 1860 

tccccctgga agctccctcg -tgcgctctcc tg-ttccgacc ctgccgctta ccggatacct: 1920 

gtccgccttt ctcccttcgg gaagcgtggc gctttctcat. agctcacgct gtaggtatct 1980 

cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 2040 

cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 2100 

atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 2160 

tacagagt-tc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat. 2220 

ctgcgctctg ctgaagccag ftaccttcgg aaaaagagtt ggtagctctt gatccggcaa 2280 
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acaaaccacc gctggtagcg gtggttt/ttt tgtttgcaag cagcagatta cgcgcagaaa 2340 
aaaaggatc-t caagaagatc ctt-tgatctt ttctacgggg tctgacgctc agtggaacga 2400 
aaactcacg-t taagggattt: tggtcatgag attatcaaaa aggatcttca cctagatcct 2460 
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtc-tga 2520 
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg at.ctgtctat -ttcgt/tcatc 2580 
catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 2640 
ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 2700 
aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactt-tat ccgcctccat 2760 
ccagtctatt. aattgt/tgcc gggaagctag agtaagtagt tcgccagtta atagtt-tgcg 2820 
caacg-ttgt-t gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 2880 
attcagctcc ggttcccaac gatcaaggcg agttacatga t.ccccca-tgt tgtgcaaaaa 2 940 
agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 3000 
actcatggt-t atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 3 060 
tt-ctgtgact ggtgagtact caaccaagtc attctgagaa tagtgta-tgc ggcgaccgag 3120 
ttgctcttgc ccggcg-tcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 31B0 
gctcatcaft ggaaaacgt-t cttcggggcg aaaactctca aggatcttac cgctgttgag 3240 
atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 3 300 
cagcg-tttc-t gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 3360 
gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 3420 
gggttattg-t ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 3480 
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gg-ttccgcgc acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat 3 540 

gacattaacc tataaaaata ggcgtatcac gaggcccttt cgtctcgcgc gtttcggtga 3600 

tgacggtgaa aacctctgac acatgcagct cccggagacg gtcacagctt gtctgtaagc 3660 

ggatgccggg agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg 3720 

ctggcttaac tatgcggcat cagagcagat tgtactgaga gtgcaccata tgcggtgtga 3780 

aataccgcac agatgcgtaa ggagaaaata ccgcatcagg cgccattcgc cattcaggct 3840 

gcgcaactg-t tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa 3900 

agggggatg-t gctgcaaggc gattaagt*tg gg-taacgcca gggttt/tccc agtcacgacg 3960 

ttgtaaaacg acggccagtg aattcggcgc gccgagctcc tcgagcaaat-ttacacattg 4020 

ccactaaacg .tctaaaccc.t ' tgtaatttgt -ttt-tg-tttta ctatg-tg-tgt tatgtatttg 4080 

atttgcgata aatttttata tttggtacta aatttataac accttttatg ctaacgtttg 4140 

ccaacactta gcaatttgca agttgattaa. ttgattctaa attatttttg tc-ttcrtaaat. 4200 

acatatacta atcaactgga aatgtaaata tttgctaata tttctactat aggagaatta 4260 

aagtgagtga atatggtacc acaaggtttg gagatttaat tgttgcaatg ctgca-tggat. 4320 

ggcatataca ccaaacattc aataattctt gaggataata atggtaccac acaagatttg 4380 

aggtgcatga acgtcacgtg gacaaaaggt ttagtaattt ttcaagacaa caatgttacc 4440 

acacacaagt fttgaggtgc atgcatggat gccctgtgga aagtttaaaa atatt*t*tgga 4500 

aatgatttgc atggaagcca tgtgtaaaac catgacatcc acttggagga tgcaataatg 4560 

aagaaaacta caaatttaca tgcaactagt tatgcatgta gtctatataa tgaggatt/t/t 4620 

gcaatacttt cattcataca cactcactaa gttttacacg attataattt. cttcatagcc 4680 
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agcccaccgc ggtgggcggc cgcctgcagt ctagaaggcc tcctgcttta atgagatatg 4740 
cgagacgcct atgatcgcat gatatttgct ttcaattctg ttgtgcacgt tgtaaaaaac 4800 
ctgagcatgt gtagctcaga tccttaccgc cggtttcggt tcattctaat gaatatatca 4860 
cccgttacta tcgtattttt atgaataata ttctccgttc aatttactga ttgtccgtcg 4920 
agcaaattta cacattgcca ctaaacgtct aaacccttgt aatttgtttt tgttttacta 4 9 80 
tgtgtgttat gtatttgatt tgcgataaat ttttatattt ggtactaaat ttataacacc 5040 
ttttatgcta acgtttgcca acacttagca atttgcaagt tgattaattg attctaaatt 5100 
atttttgtct tctaaataca tatactaatc aactggaaat gtaaatattt gctaatattt 5160 
ctactatagg agaattaaag tgagtgaata tggtaccaca aggtttggag atttaattgt 5220 
tgcaatgctg catggatggc atatacacca aacattcaat aattcttgag gataataatg 5280 
gtaccacaca agatttgagg tgcatgaacg tcacgtggac aaaaggttta gtaatttttc 5340 
aagacaacaa tgttaccaca cacaagtttt gaggtgcatg catggatgcc ctgtggaaag 5400 
tttaaaaata 5410 

<210> 38 
<211> 12093 
<212> DNA 
<213> Unknown 

*<220> 

<223> Plant expression vector with a 

promoter-terminator expression cassette 

<400> 38 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 
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gcgcccagca caggtgcgca 
tagtgggcgg tgacgtcgtt. 
ataatcaggc cgatgccgac 
atgttgggtt tcacgtctgg 
ttctttatca ctgataag-tt 
tgacaaagtt gcagccgaat 
gcgtagacgg tctgacgaca 
tttactggca cttcaggaac 
cggagaatca tacgcattcg 
ggaatgcccg cagcttcagg 
ccggcacgcg accgggcgca 
gcgaggcggg tttttcggcc 
ctgttggggc cgtgcttgag 
ccgttgaaca ggctccgc-tc 
ccggtccgga cgcagcgt*tc 
ggaggcrtcgt tgtcaggaac 
tgccggagcg caacccactc 
ccaccgcgtc agacgcccgt 
ccaagcctca cggccgcgc-t 
gctcactgac tcgc-tgcgct 
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ggcaaattgc accaacgcat 
cgagtgaacc agatcgcgca 
agcgtcgagc gcgacagtgc 
cctccggacc agcctccgct 
gg-tggacata ttatgtttat 
acagtgatcc gtgccgccct 
cgcaaactgg cggaacggtt 
aagcgggcgc tgctcgacgc 
gtgccgagag ccgacgacga 
caggcgctgc tcgcctaccg 
ccgcagatgg. aaacggccga 
ggggacgccg tcaatgcgct, 
gagcaggccg gcgacagcga 
tcgccgctgt tgcgggccgc 
gagcagggac tcgcggtgat 
gttgaaggac cgagaaaggg 
actacagcag agccatgtag 
agcagcccgc tacgggcttt 
cggcc-tc-tct ggcggcctztc 
cggtcgttcg gctgcggcga 



-28 

acagcgccag cagaatgcca 120 
ggaggcccgg cagcaccggc 180 
tcagaattac gatcaggggt 240 
ggtccgattg aacgcgcgga 300 
cagtgataaa gtgtcaagca 360 
ggacctgttg aacgaggtcg 42 0 
gggggttcag cagccggcgc 4 80 
ac-tggccgaa gccatgctgg 540 
ctggcgctca tttctgatcg 600 
cgatggcgcg cgcatccatg 6 60 
cgcgcagctt cgct/tcctct 720 
gatgacaatc agctactrtca 7 80 
tgccggcgag cgcggcggca 840 
gatagacgcc ttcgacgaag 900 
tg-tcgatgga ttggcgaaaa 960 
tgacga-ttga tcaggaccgc 1020 
acaacatccc ctcccccttt 1080 
ttcatgccct gccctagcgt 1140 
tggcgctctt, ccgcttcctc 1200 
gcggtatcag ctcactcaaa 1260 
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ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaatfa tgtgagcaaa 1320 

aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 1380 

ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 

aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 

gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 156 0 

ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 162 0 

tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctt.tcct.tgg tgtatccaac 1680 

ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 

ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 

ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 

agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 192 0 

aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 

aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 

tgggccgcct gggcggcctg^ ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 

tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 

gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 

aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 

gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 

gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 

cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 2460 
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cctcgcggaa aacrttggccc tcactgacag atgaggggcg gacgttgac'a cttgaggggc 2520 
cgactcaccc ggcgcggcgt tgacagatga ggggcaggct; cgatttcggc cggcgacgtg 2580 
gagctggcca gcctcgcaaa tcggcgaaaa cgcctgat/tt tacgcgagtt tcccacagat 2640 
gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2 700 
tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 
gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 
ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 
aaaccttgt-t t/ttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 
tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 
tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 
ttgccgggat cggggcagta acgggatggg . cgatcagccc gagcgcgacg cccggaagca 3120 
ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggt.gccgggc agtgagggcg 3180 
gcggcctggg -tggcggcctg cccttcact-t cggccgtcgg ggcattcacg gacttcatgg 3240 
cggggccggc aatttttacc ttgggcattc ttggcatagt gg-tcgcgggt? gccgtgctcg 33 00 
tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 
gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 342 0 
ctaccaagac gaagaggatg aagaggatga ggaggcagat. tgccttgaat atattgacaa 348 0 
tac-tgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 
ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 
tgcatggact aatgcttgaa . acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 
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attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 
tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 
gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt. atatcgcttg ctgattacgt 3840 
gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 
cgtcaaaggg tgacagcagg crtcataagac gccccagcgt cgccatagtg cgttcaccga 3960 
atacg-tgcgc aacaaccg-tc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 
gcgatttagc cccgacatag ccccactgtt cgtccatt/tc cgcgcagacg atgacgtcac 4 080 
tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 
cgtgt.tgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 
catatcaatg attt.tctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 
ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4320 
acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 
agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 
tggfttcaaa atcggctccg tcgatactat gttatacgcc aact.ttgaaa acaactttga 4500 
aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 
cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4 620 
taaa-tggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 
gtaaaagata cggaaggaat gtctcctgct. aaggtatata agctggtggg agaaaatgaa 4740 
aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 
gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcactt/t 4860 
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gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 492 0 
gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggag-tgcatc 498 0 
aggc-tctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 504 0 
ttagccgaat -tggattact-t actgaataac gatctggccg atgtggattg cgaaaactgg 510 0 
gaagaagaca ctccatttaa agatccgcgc gagctgtatg atttt-ttaaa gacggaaaag 516 0 
cccgaagagg aacttgtctt t/tcccacggc gacctgggag acagcaacat ctt-tgtgaaa 5220 
gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caag-tggtat 5280 
gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 
ctattttttg acttactggg gatcaagcc-t gattgggaga aaataaaata ttatatttta 5400 
.ctggatgaat tgttttagta cctagatgtg gcgcaacgat. gccggcgaca agcaggagcg 5460 
caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtat-tt, 5520 
gggcaagggg tcgctgg-tat. tcgtgcaggg caagattcgg aa-taccaagt acgagaagga 5580 
cggccagacg gtctacggga ccgacttcat. tgccgataag gtggattatc tggacaccaa 5640 
ggcaccaggc ggg-tcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 
cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 
cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 
gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 
gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5 940 
ttcgcgtcgi: ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 
aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 
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cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 
ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 
ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 
ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 
cgacgatgac gaactggHgt ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 6360 
cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 
ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 6480 
cacgtccgac cgcgttgggc acctggaatc gg-tgtcgctg ctgcaccgct tccgcgtcct 6540 
ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 
gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 
ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 
aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6 780 
cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6840 
tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 
agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6 960 
tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7 02 0 
ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7 080 
cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 7140 
cacgaggaga aaaagcccat ggaggcgntc gc-tgaacggt tgcgagatgc cgtggcattc 7200 
ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga ggacggcccc 7260 
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aaggacgctc acaaggcgca tctgtccggc g-ttttcgtgg agcccgaac'a gcgaggccga 7320 
ggggtcgccg gtatgctgct gcgggcgttg ccggcgggtt. tattgctcg*t gatgatcg*tc 7380 
cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 7 44 0 
tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7500 
acggtaggcg ctgtgcagcc gctgatggtc gtgttcatct ctgccgctct gc-taggtagc 7560 
ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgctgttg 762 0 
gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct. ggcgggggcg 7680 
gt-ttccatgg cgt/tcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 
acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccag-t agctttagtg 7800 
•tttgatccgc caa-tcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 7860 
ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7920 
acagntgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccgggga-tg 7980 
catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8040 
aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 
cggctttatc cagcgatttc crtat.tatg«tc ggcatagttc tcaagatcga cagcctgtca 8160 
cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 
tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 
tcatccgtgt ttcaaacccg gcagcttagt, t.gccgtt.ct/t ccgaatagca tcggtaacat 83 4 0 
gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgt.cccgga ctgatgggct 84 00 
gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 8460 
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agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 9720 

gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9780 

gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9B40 

gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 

tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 9960 

agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc 10020 

ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 

ccgattgtct. g-ttgtgccca gtcatagccg aa*tagcctct ccacccaagc ggccggagaa 10140 

cctgcgtgca atccatcttg ttcaatccaa gc-tcccatgg gccctcgact. agagtcgaga 10200 

tctggatt.ga gagtgaata.t gagactctaa -ttggatacc.g- aggggaattt atggaacgtc 102 60 

agtggagcat^ttttgacaag „aaa-tatttgc. tagctgatag tgacc-ttagg cgacttttga 10320 

acgcgcaata atggtt-tctg acgtatgtgc ttagc-tcatt aaactccaga aacccgcggc 103 80 

tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 

gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 

gcgccatcag a-tcct/tggcg gcaagaaagc catccagt/tt. actttgcagg gcttcccaac 10560 

<rtt:accagag ggcgccccag ctggcaattc cggttcgctt. gctg-tccata aaaccgccca 10620 

gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 

tt-ccct-tgtc cagatagccc agtagc-bgac at-tcatccgg ggtcagcacc gtttcbgcgg 10740 

actggctttc tacgtgttcc gct-tccttta gcagcccttg cgccctgagt gcttgcggca 10800 

gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 
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attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 
tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 
tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 
aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 
attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 
ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 11220 
tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 
taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 
tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 
aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 11460 
ttttgcaata ctttcattca tacacactca. ctaagtttta cacgattata atttcttcat 11520 
agccagccca ccgcggtggg cggccgcctg cagtctagaa ggcctcctgc tttaatgaga 11580 
tatgcgagac gcctatgatc gcatgatatt' tgctttcaat tctgttgtgc acgttgtaaa 11640 
aaacctgagc atgtgtagct cagatcctta ccgccggttt cggttcattc taatgaatat 11700 
atcacccgtt actatcgtat ttttatgaat aatattctcc gttcaattta ctgattgtcc 11760 
gtcgacgaat tcgagctcgg cgcgcctcta gaggatcgat gaattcagat cggctgagtg 11820 
gctccttcaa cgntgcggt* ctgtcacpttc caaacgtaaa acggcttgtc ccgcgtcatc 11880 
ggcgggggtc ataacgtgac tcccttaatt ctccgctcat gatcagattg tcgtttcccg 11940 
ccttcagttt aaactatcag tgtttgacag gatatattgg cgggtaaacc taagagaaaa 12000 
gagcgtttat tagaataatc ggatatttaa aagggcgtga aaaggtttat ccttcgtcca 12060 
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tttgtatgtg catgccaacc acagggttcc cca • 12093 

<210> 39 
<211> 12085 
<212> DNA 
<213> Unknown 

<220> 

<223> Plant expression vector with a 

promoter-terminator expression cassette 

<4'0 0> 3 9 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 

gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 

tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 

ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 240 

atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 

ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 

tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 42 0 

gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 

tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 54 0 

cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 

ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 

ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 

gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 

ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 84 0 
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ccgttgaaca ggctccgctc tcgccgctgt. tgcgggccgc gatagacgcc t/tcgacgaag 900 
ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga t/tggcgaaaa 960 
ggaggctcgt tg-tcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 
tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 
ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 
ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctct-t. ccgcttcctc 12 00 
gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 12 60 
ggcggtaata cgg-ttatcca cagaa-tcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgt/ttt tccataggct 1380 
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 
aggactataa agataccagg cgtt/tccccc tggaagctcc ctcgtgcgc-t ctcctgttcc 1500 
gaccctgccg cttaccggat acctgtccgc, ctttctccct tcgggaagcg -tggcgctttt. 4560 
ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtata-t ccatcctttt. 1620 
tcgcacgata tacaggattt tgccaaaggg ttcgtg*taga ct-ttccttgg tgtatccaac 1680 
ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 
ctgtccctta ttcgcacctg gcggtgctca acgggaatcc -tgctctgcga ggctggccgg 1800 
ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 
agggcagccc acctatcaag gtgtac-tgcc ttccagacga acgaagagcg attgaggaaa 1920 
aggcggcggc ggccggcatg agcctgtcgg cctaccfcgct ggccgtcggc cagggctaca 19 80 
aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 
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tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 
tcgg-tgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagct/tg 2160 
gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgactt/ttt tagccgcrtaa 2220 
aacggccggg gggtgcgcgt gat/tgccaag cacgtcccca tgcgctccat caagaagagc 22 80 
gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 
gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 
cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 24 60 
cc-tcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 
cgactcaccc ggcgcggcgt, tgacagatga ggggcaggct cga-tttcggc cggcgacgtg 2580 
gagctggcca gcctcgcaaa tcggcgaaaa cgcctgat/tt tacgcgag-tt; tcccacagat 2640 
gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2700 
tgacagatga ggggcgcgat cct-tgacact tgaggggcag agtgctgaca gatgaggggc -2760 
gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 
ccgcccgttt ttcggccacc gctaacctgt. cttttaacc-t gctt/ttaaac caatatttat 2880 
aaaccttgtt tttaaccagg gctgcgccct. gtgcgcgtga ccgcgcacgc cgaagggggg 2940 
tgccccccct tctcgaaccc tcccggcccg crtaacgcggg cctcccatcc ccccaggggc 3000 
tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttigcca 3060 
ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 
ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 
gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcat/tcacg gacttcatgg 3240 
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cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 
tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 
gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 342 0 
ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 
tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 
ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 
tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 366 0 
attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 372 0 
tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 
gctgcctcag attcaggtta tgccgctcaa ttcgctgcg* atatcgcttg ctgattacgt 3340 
gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 
cg-tcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga -3960 
atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 
gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4 080 
tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 414 0 
cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 
catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 
ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4 320 
acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 
agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 
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tggtttcaaa atcggctccg tcgatactat gttatacgcc aactt/tgaaa acaactttga 4 5 00 
aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4 5 60 
cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4 6 20 
taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 46 80 
gtaaaagata cggaaggaat. gtctcctgct aaggtatata agctggtggg agaaaatgaa 47 40 
aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4 8 00 
gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4 8 60 
gaacggcatg atggc-tggag caatctgctc a-tgagtgagg ccgatggcgt cctttgctcg 4920 
gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4 9 80 
aggctctt/tc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 
ttagccgaafc tggat/tacrtt ^actgaataac gatctggccg atgtggattg cgaaaactgg 5100 
gaagaagaca ctccatttaa agatccgcgc \ gagctgtatg attttttaaa gacggaaaag 5160 
cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 
gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 
gacattgcct tctgcgt.ccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 
ctattttttg acttacrtggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 
ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 
caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 
gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt. acgagaagga 5580 
cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5 640 
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ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 57 00 
cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 
cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 
gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 
gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 
ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 
aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 
cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 
ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 
ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 
ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 
cgacgatgac gaactggtgt. ggcagcaggt _ gttggagtac gcgaagcgca cccctatcgg 6360 
cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 
ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 6480 
cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 
ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6 600 
gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 
ggcccgacgg atgttcgact atttcagctc gcaccgggag ccg-tacccgc tcaagctgga 6720 
aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 
cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6840 
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tgacctggtg cattgcaaac gctagggcct tgtggggtca g-ttccggctg ggggttcagc 6900 

agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 

tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7020 

ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 

cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttxrgggtc cgtttacgag 7140 

cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7 200 

ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga gga.cggcccc 7260 

aaggacgctc acaaggcgca tctgtccggc gt-tttcgtgg agcccgaaca gcgaggccga 7320 

ggggtcgccg gtatgctgct gcgggcgttg ccggcggg-tt tattgctcgt gatgatcgtc 7 380 

cgacagattc caaegggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 7440 

-tcgctattct ggagcttgtt gtttat-ttcg gtctaccgcc tgccgggcgg gcftcgcggcg 7500 
acggtaggcg ctgtgcagcc gc-tgatgg-tc, gtgttca-tct ctgccgctct gc-taggtagc 7560 

ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgctgttg 7 62 0 

gtgttgacac caaacgcagc gctagatcct. gtcggcgtcg cagcgggcct ggcgggggcg 7 680 

gtttccatgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 

acctttaccg cctggcaact. ggcggccgga ggacttctgc tcgttccagt. agctttagtg 7 80 0 

tttgatccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 7 860 

ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct. 7920 

acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7980 

catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8040 
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aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 
cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 
cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 
tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 
tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 8340 
gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 
gcctgtaticg agtggtgatt ttgtgccgag c-tgccggtcg gggagctgtt ggctggctgg 8460 
■tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 
gacgt.ttt.ta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 
tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg g-tttgcccca 8 640 
gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 
agaatagccc gagatagggt tgagtgttgt. tccagtttgg aacaagagtc cactattaaa -8 7 60 
gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 882 0 
tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 
ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8 940 
ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 
gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 
gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 
aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 
ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 
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tcaataactg attatatcag ctggtacat/t gccg-tagatg aaagactgag tgcgatatta 9300 
tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 
agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 
tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 9480 
tcagcaata-t cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 9540 
ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 
tcgccatggg tcacgacgag atcc-tcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 
agttcggctg gcgcgagccc ctga-tgctct tcgtccagat catcctgatc gacaagaccg 9720 
gcttccatcc gagtacgtgc tcgc*tcgatg cgatgtttcg cttggtgg-fcc gaatgggcag 9 780 - 
gtagccggat caagcgtatg^cagccgccgc attgcatcag ccatgatgga iactttctcg 9840 
gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9 900 
tcccttcccg cttcagtgac aacgtcgagc. acagctgcgc aaggaacgcc cgtcgtggcc -9960 
agccacgata gccgcgctgc c-tcgtcctgc agttcattca gggcaccgga caggtcggtc 10020 
ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 
ccgattgtct, gttgt.gccca g-tcatagccg aatagcctct ccacccaagc ggccggagaa 10140 
cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 
tctggattga gagtgaatat gagactctaa tuggataccg aggggaattt atggaacgtc 10260 
agtggagca-t ttttgacaag aaatatttgc -tagctgatag tgaccttagg cgacttttga 10320 
acgcgcaata atgg-tttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 
tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 
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gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 
gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 
cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 10620 
gtctagc-tat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 
ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gt-ttctgcgg 10740 
actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10800 
gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 
attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 
tttgatttgc gataaatttt tatatttggt. actaaattta taacaccttt tatgctaacg 10980 
tt-tgccaaca c-btagcaat.t tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 
aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 
attaaagtga gtgaatatgg taccacaagg. tttggagatt taattgttgc aatgctgcat -11160 
ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 11220 
tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 
taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 
tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 
aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 114 6 0 
ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 1152 0 
agccagcgga tccgatatcg ggcccgctag cgttaaccct gctttaatga gatatgcgag 11580 
acgcctatga tcgcatgata tutgctttca attctgttgt gcacgttgta aaaaacctga 11640 
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gcatgtgtag ctcagatcct taccgccggt ttcggttcat tctaatgaat atatcacccg 11700 

ttactatcgt atttttatga ataatattct ccgttcaatt tactgattgt ccgtcgacga 11760 

attcgagctc ggcgcgcctc tagaggatcg atgaattcag atcggctgag tggctccttc 11820 

aacgttgcgg ttctgtcagt tccaaacgta aaacggcttg tcccgcgtca tcggcggggg 11880 

tcataacgtg actcccttaa ttctccgctc atgatcagat tgtcgtttcc cgccttcagt 1194 0 

ttaaactatc agtgtttgac aggatatatt ggcgggtaaa cctaagagaa aagagcgttt 12000 

attagaataa tcggatattt aaaagggcgt gaaaaggttt atccttcgtc catttgtatg 12060 

tgcatgccaa ccacagggtt cccca 12085 

<210> 40 

<211> 12079 - \ 

<212> DNA 
<213> Unknown 

<220> 

<223> Plant expression vector with a 

promoter-terminator expression cassette 

<400> 40 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 

gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 

tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 

ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 240 

atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 

ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 
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tgacaaagtt. gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 420 
gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 
tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 
cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 
ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 
ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 
gcgaggcggg ttt/ttcggcc ggggacgccg tcaatgcgct gatgacaa-tc agctacttca 780 
ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 
ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 900 
ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 
ggaggetcgt. tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 
tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 
ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 
ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 
gctcactgac tcgctgcgct cggtcg-ttcg gctgcggcga gcggtatcag ctcactcaaa 1260 
ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 1380 
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 
aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 
gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 
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ccgctgcata accctgcttc ggggtcatta -tagcgatttt ttcggtata't ccatcctttt 1620 

tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 1680 

ggcgtcagcc gggcaggata gg-tgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 

ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 

ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 

agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 

aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 

aaatcacggg cg-tcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2 040 

tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 

tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 

gcaaggtcat gatgggcg-tg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2 2 20 

aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 22 80 

gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 

gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgt:c1:atg gccctgcaaa 2 4 00 

cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtgga-ta 2 4 60 

cctcgcggaa aacttggccc tcactgacag atgaggggcg gacg-ttgaca cttgaggggc 2 520 

cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 

gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2640 

gatgtggaca agcctgggga taagtgccct gcggta-ttga cacttgaggg gcgcgactac 2 700 

tgacagatga ggggcgcgat. ccttgacact tgaggggcag agtgctgaca gatgaggggc 2 7 60 



0093/00064 <=* 02485060 2 oo4-io- 2 e 

128 

gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 
ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 
aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 
tgccccccct tctcgaaccc tcccggcccg ctaacgcggg ectcccatcc ccccaggggc 3000 
tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 
ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 
ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 
gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 
cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 
tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 
g-tatgaaaac gagaattgga cctttacaga. attactctat gaagcgccat atttaaaaag 34 2 0 
ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 
tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 
ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3 6 00 
tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3 660 
attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 
tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 37 80 
gctgcctcag attcaggtta tgccgctcaa ttcgct-gcgt atatcgcttg ctgattacgt 3840 
gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 
cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 
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atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 402 0 
gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 
tgcccggctg tatgcgcgag gttaccgact gcggcctgag tt.tttt.aag1: gacgtaaaat 414 0 
cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 
catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4 260 
ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4320 
acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 
agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 
tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 
aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 
cttgttataa ttagcttctt- ggggtatctt . taaatactgt agaaaagagg aaggaaataa 4 62 0 
taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 
gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 
aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4 800 
gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 
gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 
gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 
aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 
ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 
gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 
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cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 

gacattgcct. tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 

ctattttttg act*tactggg gatcaagcct gat.tgggaga aaataaaata ttatatt-t-ta 5400 

ctggatgaat tgttt/tagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtatrtt 5520 

gggcaagggg tcgctggtat -tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 

cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 

ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 

cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 

cgacgcgggg ttttccgccg aggatgccga. aaccatcgca agccgcaccg tcatgcgtgc 5820 

gccccgcgaa accttccagt ccgtcggctc gatggtccag caagotacgg ccaagatcga 5880 

gcgcgacagc gtgcaac-tgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 

ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 

aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 

cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 

ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 

ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 

ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 

cgacgatgac gaactggtgt ggcagcaggt g-ftggagtac gcgaagcgca cccctatcgg 6 3 60 
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cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctgg*t cgatcaatgg 6420 

ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggct/t 6480 

cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 

ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 

gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 

ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 

aacct-tccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 678 0 

cggcgaagcc tgcgaagagt tgcgaggcag cggcctggt.g gaacacgcct gggtcaatga 6840 

tgaccrtggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 

agccagcgct ttactggcat. 'ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6 960 

tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggat/taaaa 7020 

ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 

cgcgagatcc gattg-tcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 714 0 

cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7200 

ggcgcctaca tcgacggcga gatcattggg ctg-tcggtct tcaaacagga ggacggcccc 726 0 

aaggacgctc acaaggcgca tctgtccggc gttt/tcgtgg agcccgaaca gcgaggccga 7 32 0 

ggggtcgccg gtatgctgct gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7 38 0 

cgacagattc caacgggaat. ctggtggatg cgcatcttca tcctcggcgc acttaatatt 744 0 

tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 750 0 

acgg-taggcg ctgtgcagcc gctgatggtc g-tgttcatct ctgccgctct gctaggtagc 7560 
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ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggctft ggcgctgttg 7620 
gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7 680 
gtttccatgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 
acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccagt agctttagtg 7800 
tttgatccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 7860 
ctgatcggag egggtttaac ctacttcctt tggttccggg gga-tctcgcg actcgaacct 7920 
acagttgttt ccttactggg ctttctcagc cccagatctg ggg-tcgatca gccggggatg 7980 
catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8040 
aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 
cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 
cggttaagcg agaaatgaat aagaaggctg. ataattcgga tctctgcgag ggagatgata 8220 
-tttgatcaca ggcagcaacg ctctgtcatc g-ttacaatca acatgctacc ctccgcgaga 82 80 
tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat B340 
gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 
gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 84 60 
tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 
gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 858 0 
tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 
gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8 7 00 
agaatagccc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa 8 7 60 



* CA 02485060 2004-10-28 

0093/00064 

133 

gaacgtggac tccaacg-tca aagggcgaaa aaccgtctat cagggcgat'g gcccacrtacg 8820 
tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 
ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8940 
ggaagggaag aaagcgaaag gagcgggcgc cattcaggct. gcgcaactgt tgggaagggc 9000 
gatcggtgcg ggcctcttcg cta-ttacgcc agctggcgaa agggggatgt gctgcaaggc 9060 
gattaagtfcg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 
aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 
ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaat/t cagaaatatt. 9240 
tcaataactg attatatcag ctggtacatt. gccgtagatg aaagactgag tgcgatatta 9300 
tgtgtaatac ataaattgat gatatagcta ' gcttagctc'a tcgggggatc cgtcgaagct 93 60 
agct*tgggtc ccgctcagaa gaactcgtca- agaaggcgat: agaaggcgat' gcgctgcgaa 9420 
tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 94 8 0 
tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 954 0 
ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 
tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 
agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 972 0 
gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9780 
gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9840 
gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 
tcccttcccg cttcag-tgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 9960 
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agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcacbgg'a caggtcggtc 1002 0 
ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 
ccgattgtct. gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 
cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 
tctggattga gagtgaatat gagactctaa ttggataccg aggggaattt atggaacgtc 10260 
agtggagcat ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 10320 
acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 
tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 
gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 
gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 
c-ttaccagag ggcgccccag ctggcaattc - cggttcgctt gctgtccata aaaccgccca 10620 
gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 
ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 1074 0 
actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10800 
gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 
attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 
tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 
tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 
aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 
attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 
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ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 1122 0 
tttgaggtgc atgaacgtca. cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 
taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 
tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 
aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 11460 
ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 1152 0 
agccagcaga tctgccggca tcgatcccgg gccatggcct gctttaatga gatatgcgag 115 8 0 
acgcctatga tcgcatgata tttgctttca attctgttgt gcacgttgta aaaaacctga 11640 
gcatgtgtag ctcagatcct t,accgccggt ttcggttcat tctaatgaat atatcacccg 11700 
ttactatcgt atttttatga. ataatattct: ccgttcaatt tactgattgt ccgtcgacga 11760 
gctcggcgcg cctctagagg atcgatgaat .tcagatcggc tgagtggctc cttcaacgtt 11820 
gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc gtcatcggcg ggggtcataa 11880 
cgtgactccc ttaattctcc gctcatgatc agattgtcgt ttcccgcctt cagtttaaac- 11940 
tatcagtgtt tgacaggata tattggcggg taaacctaag agaaaagagc gtttattaga 12000 
ataatcggat atttaaaagg gcgtgaaaag gtttatcctt cgtccatttg tatgtgcatg 12060 
ccaaccacag ggttcccca 1207 9 



<210> 41 
<211> 13002 
<212> DNA 
<213> Unknown 



<220> 

<2 2 3> Plant expression vector with two 
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promoter-terminator expression cassettes 
<400> 41 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 
gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 12 0 
tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 18 0 
ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 24 0 
atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 30 0 
ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 
tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 420 
gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 
tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 
cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 
ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 
ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 
gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 
ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 
ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 900 
ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 
ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 
tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 
ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 
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ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 
gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 1260 
ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 1380 
ccgcccccct. gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 
aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 
gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 
ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 16 20 
tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 16 80 
ggcg-tcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 
ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 
ctaccgccgg cgtaacagat gagggcaagc.ggatggct.ga tgaaaccaag ccaaccagga -1860 
agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 
aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 198 0 
aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 
tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 
tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 
gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 
aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 
gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 
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gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 
cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 2460 
cctcgcggaa aacttggccc tcactgacag atgaggggcg gacg-ttgaca cttgaggggc 2520 
cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 258 0 
gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 264 0 
gatgtggaca agcctgggga taagtgccct. gcggtattga cacttgaggg gcgcgactac 2700 
tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 
gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggt-tt, 2 82 0 
ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat. 2880 
aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2 940 
tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 
tgcgcccctc ggccgcgaac ggcctcaccc. caaaaatggc agcgctggca gticct/tgcca 3060 
ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 312 0 
ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 
gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 
cggggccggc aatttttacc ttgggcattc ttggcatagt. gg-tcgcgggt gccgtgctcg 3300 
tg-ttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 
gtatgaaaac gagaattgga cctttacaga attactctat. gaagcgccat atttaaaaag 3420 
ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 
tactga-taag ataatatatc ttttatatag aagatatcgc cg-tatgtaag gatttcaggg 3540 
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ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 
tgcatggact aatgcttgaa acccaggaca ataaccttat. agcttgtaaa ttctatcata 3660 
attggg-taat. gactccaact tattgatagt gttttatgtt. cagataatgc ccgatgactt 3720 
tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 
gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt, 3840 
gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 
cgtcaaaggg -tgacagcagg ctcataagac gccccagcgt cgccatag-tg cgttcaccga 39 60 
atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 
gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 
tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 
cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 42 00 
catatcaatg attttctggt gcgtaccggg. ttgagaagcg gtgtaagtga actgcagttg 4260 
ccatgtttta cggcagtgag agcagagata gcgctgatg-t ccggcggtgc ttttgccg-tt 4320 
acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 
agcacctcaa aaacaccatc atacactaaa tcagtaagtt. ggcagcatca cccataattg 44 40 
tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 
aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 
cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4 620 
taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 
gtaaaagata cggaaggaat. gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 
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aacctatatt, taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 48 00 

gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 48 60 

gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 

gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 

aggctctttc actccatcga catatcggat. tgtccctata cgaatagctt agacagccgc 5040 

t-tagccgaat. tggatt-actt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 

gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 

cccgaagagg aactt.gt.ctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 

gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 

ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 54 00 

ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 

gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 

cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 

ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 

cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5 7 60 

cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 

gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 

gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 
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ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6 0 00 
aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6 0 60 
cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 
t-tccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 
ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 
ggtcat*tt:tc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 
cgacgatgac gaactggtgt ggcagcagg-t gttggagtac gcgaagcgca cccctatcgg 6360 
cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 
ccggtattac acgaaggccg aggaatgcct. g.tcgcgccta- caggcgacgg: cgatgggctt. 6480 
cacgtccgac cgcgttgggc acctggaatc ggtg-tcgctg ctgcaccgct tccgcgtcct 654 0 
ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 
gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac -6660 
ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 67 20 
aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6 7 80 
cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6 840 
tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 
agccagcgct ttactggcat. ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6 9 60 
tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7 020 
ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggat/ttc 7080 
cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 7140 
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cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7200 
ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga ggacggcccc 7 260 
aaggacgctc acaaggcgca tctgtccggc gttttcgtgg agcccgaaca gcgaggccga 7 320 
ggggtcgccg gtatgctgct gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7 3 80 
cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt. 7440 
tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7 500 
acggtaggcg ctgtgcagcc gctgatggtc gtgttcatct ctgccgctct gctagg-tagc 7560 
ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgc-tgt/tg 7 620 
g-fcgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7680 
g-tttcca-tgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctxrtgctc 774 0 
acct.tt.accg cctggcaact ggcggccgga ggacttctgc tcgttccagt agct/ttagtg 7800 
t-btgatccgc caatcccgat gcctacagga. accaatgttc tcggcctggc gtggctcggc -7860 
ctgatcggag cgggtttaac ctacttcctt. tggttccggg ggatctcgcg actcgaacct 7 92 0 
acagttg-ttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7980 
catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat. 8040 
aggggag-ttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcc-tcag 8100 
cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 
cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 
tttgatcaca ggcagcaacg ctctgtca*tc gttacaatca acatgc-tacc ctccgcgaga 8280 
tcatccgtg-t ttcaaacccg gcagcttagt tgccgttct.t ccgaatagca tcggtaacat 8340 
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gagcaaagtc tgccgcct/ta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 
gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 8460 
tggcaggata tattgtggtg taaacaaatt gacgct/taga caacttaata acacattgcg 8520 
gacgtttt/ta atgtac-tggg gtggt/ttttc ttttcaccag tgagacgggc aacagctgat 8580 
tgcccftcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 
gcaggcgaaa atcctgtttg a-tggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 
agaatagccc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cac.tattaaa 8760 
gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820. 
tgaaccatca cccaaatcaa g-ttttt-tggg- gtcgaggtgc cgtaaagcac -taaatcggaa" 8880 " 
ccctaaaggg : agcccccgat * tt.agagc*tt.g acggggaaag ecggcgaacg tggcgagaaa B940 
ggaagggaag aaagcgaaag, gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 
gatcggtgcg ggcctcttcg ctattacgcc. agctggcgaa agggggatgt gctgcaaggc 9060 
gattaagt.tg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 
aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 
ggtattatag tccaagcaaa aacataaatt ta-ttgatgca ag-tttaaatt. cagaaatatt. 9240 
tcaataactg at/tat.atcag ctggtaca-tt gccgtagatg aaagactgag tgegatat/ta 93 00 
tg-tgtaatac a-taaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 
agcttgggtc ccgctcagaa gaactcgtca agaaggcgat. agaaggcgat gcgc-tgcgaa 94 2 0 
tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 94 80 
tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 9540 
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ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9 6 00 
tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 
agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 9720 
gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9 7 80 
gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9840 
gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 
tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc egtcgtggcc 99 60 
agccacgata gccgcgctgc ctcgtcc-tgc agttcattca gggcaccgga caggtcggtc 10020 
ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 
ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 
cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 
tctggattga gagtgaatat gagactctaa. ttggataccg aggggaattt atggaacgtc -10260 
agtggagcat. ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 10320 
acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 
tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 
gtcatcggcg ggggtcataa cgtgactccc ttaattct.cc gctcatgatc ttgatcccct 10500 
gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 
cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 10620 
gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 
ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 
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actggctttc tacgtgtt.cc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10800 
gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 
attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 
tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 
tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 
aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 
attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 
ggatggcata tacaccaaac attcaataat tcttgaggat aataatg.gta ccacacaaga 1122 0 
tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 
taccacacac aagttttgag - gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 1134 0 
tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 
aatgaagaaa actacaaatt tacatgcaac. tagttatgca tgtagtctat ataatgagga -11460 
ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 
agccagccca ccgcggtggg cggccgcctg cagtctagaa ggcctcctgc tttaatgaga 11580 
tatgcgagac gcctatgatc gcatgatatt tgctttcaat tctgttgtgc acgttgtaaa 11640 
aaacctgagc atgtgtagct cagatcctta ccgccggttt cggttcattc taatgaatat 11700 
atcacccgtt actatcgtat ttttatgaat aatattctcc. gttcaattta ctgattgtcc 11760 
gtcgagcaaa tttacacatt gccactaaac gtctaaaccc ttgtaatttg tttttgtttt 1182 0 
actatgtgtg ttatgtattt gatttgcgat aaatttttat atttggtact aaatttataa 11880 
caccttttat gctaacgttt gccaacactt agcaatttgc aagttgatta attgattcta 11940 
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aatta-ttttt gtcttc-taaa tacatatact aatcaactgg aaatgtaaat atttgctaat 12000 
atttctacta taggagaatt aaagtgagtg aatatggtac cacaaggttt ggagatttaa 12060 
ttgttgcaat gctgcatgga tggcatatac accaaacatt caataattct tgaggataat 12120 
aa-tggtacca cacaagattt gaggtgcatg aacgtcacgt ggacaaaagg tttagtaatt 12180 
tttcaagaca acaatgttac cacacacaag ttttgaggtg catgcatgga tgccctgtgg 12240 
aaagtttaaa aatattttgg aaatgatttg catggaagcc atgtgtaaaa ccatgacatc 12300 
cacttggagg atgcaataat gaagaaaact acaaatttac atgcaactag ttatgcatgt 1236 0 
agtctatata atgaggattt tgcaatactt tcattcatac acactcacta agttttacac 1242 0 
gattataa-tt tcttcatagc cagcggatcc gatatcgggc ccgctagcgt taaccctgct 1248 0 
ttaatgagat. atgcgagacg cctatgatcg catgatattt gctttcaatt ctgttgtgca 12540 
cgttgtaaaa aacctgagca tgtgtagctc agatccttac cgccggtttc ggttcattct 12 600 
aatgaatata tcacccgtta ctatcgtatt. tttatgaata atattctccg ttcaatttac 12660 
tgattgtccg tcgacgaatt cgagctcggc gcgcctctag aggatcgatg aattcagatc 12720 
ggctgagtgg ctccttcaac gttgcggttc tgtcagttcc aaacgtaaaa cggcttgtcc 12780 
cgcgtcatcg gcgggggtca taacgtgact cccttaattc tccgctcatg atcagattgt 12840 
cgtttcccgc cttcagttta aactatcagt gtttgacagg atatattggc gggtaaacct 12900 
aagagaaaag agcgtttatt agaataatcg gatatttaaa agggcgtgaa aaggtttatc 12960 
cttcgtccat ttgtatgtgc atgccaacca cagggttccc ca 13002 



<210> 42 
<211> 13905 
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<212> DNA 
<213> Unknown 

<220> 

<223> Plant expression vector with three 

promoter-terminator expression cassettes 

<400> 42 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 6 0 

gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 

tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 

ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 2 40 

atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 3 00 

ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 3 60 

tgacaaagtt gcagccgaat acagtgatcc . gtgccgccct ggacctgttg aacgaggtcg 420 

gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 

tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 

cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 

ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 

ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 

gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 

ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 

ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 9 00 

ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 9 60 
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ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 
tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 
ccaccgcgtc agacgcccgt agcagcccgc tacgggctt/t ttcatgccct. gccctagcgt 114 0 
ccaagcctca cggccgcgct cggcctctct ggcggcctiic tggcgctctt. ccgcttcctc 1200 
gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 1260 
ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 
aggccagcaa aaggccagga accgtaaaaa ggccgcgtt.g ctggcgtttt tccataggct 1380 
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 
aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 
gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 
ccgctgcata accctgcttc ggggtcatta. tagcgatttt ttcggtatat ccatcctttt 1620 
tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 168 0 
ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtyt tccttcttca 1740 
ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 
ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga I860 
agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 
aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 
aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 
tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 
tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 
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gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc at.gact.tttt tagccgctaa 2220 
aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 
gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 
gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 
cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 2460 
cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 
cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 
gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2 64 0 
gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2700 
tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 
gcacctattg acatttgagg ggctgtccac. aggcagaaaa tccagcattt gcaagggttt 2820 
ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2 880 
aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 
tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 
tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 
ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 
ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 
gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 
cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 
tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 
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gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 342 0 
ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 
tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 
ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 
tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 
attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 372 0 
tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 
gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3840 
gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 
cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 
atacgtgcgc aacaaccgtc ttccggagac. tgtcatacgc gtaaaacagc cagcgctggc 4020 
gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 
tgcccggctg tatgcgcgag gttaccgact gcggcctgag tttt.tt.aagt gacgtaaaat 4140 
cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 
catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4 2 60 
ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4 320 
acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 
agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 
tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 
aaaagctgt-t ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 45 60 
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cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4620 
taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 
gtaaaagata cggaaggaat gtctcctgct; aaggtatata agctggtggg agaaaatgaa 4 740 
aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 
gaaaaggaca tgatgcrtatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 
gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 
gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 
aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 
ttagccgaat tggattactt actgaataac gatctggccg ~atgtggattg cgaaaactgg 5100 
gaagaagaca ctccatttaa agatccgc^c" gagctgtiatg attttttaaa gacggaaaag 5160 
cccgaagagg aacttgtctt; ttcccacggc. gacctgggag acagcaacat ctttg-tgaaa 522 0 
gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 52 8 0 
gacattgcct. tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 
ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 
ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 
caccgac-ttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 
gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 
cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 
ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 
cccgcaagga ggg-tgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 
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cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcacCg tcatgcgtgc 5820 
gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 
gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 
ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 
aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 
cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 
ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 
ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 
gg tc.«ttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 
cgacgatgac gaactggtg* ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 6360 
cgagccgatc accttcacgt tctacgagct. tt gccaggac ctgggctggt cgatcaatgg 6420 
ccgg t a tt ac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggct* 6480 
cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 
ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 
gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 
ggcccgacgg atgttcgact a^cagctc gcaccgggag ccgtacccgc tcaagctgga 6720 
aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 
cggcgaagcc tgcgaagagl: tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6840 
tgacctggtg cattgcaaao gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 
agccagcgc* ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 
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tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7 020 
ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 
cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgt-tcgggtc cgtttacgag 7140 
cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7200 
ggcgcctaca tcgacggcga gatcattggg ctgtcg.gtct tcaaacagga ggacggcccc 7260 
aaggacgctc acaaggcgca tc*tgt:ccggc gttt.tcgtgg agcccgaaca gcgaggccga 7320 
gg'ggtcgccg gtatgctgct gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7380 
cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 7440 
tcgctattct ggagcttgtt gtttatt-tcg gtctaccgcc tgccgggcgg ggtcgcggcg 7500 
acggtaggcg. ctg-tgcagqc gctga-tggtc gtgttcatct ctgccgctct gctag'gtagc 7560' 
ccgatacgat tgatggcggt- cctgggggct atttgcggaa etgcgggcgt ggcgctgttg 7620 
gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7680 
gtttccatgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 774 0 
acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccagt agctttagtg 7800 
tttgatccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 7860 
ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 792 0 
acag-ttgttt ccttactggg ctttctcagc cccagatc-tg gggtcgatca gccggggatg 7980 
catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8040 
aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 
cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 
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cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcga'g ggagatgata 8220 
tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 
tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 8340 
gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggc-t 8400 
gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt. ggctggctgg 8460 
tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 
gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 
tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gt/ttgcccca 8640 
gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 
agaatagccc gagatagggt tgagtgttgt. tccagtttgg aacaagagtc cactattaaa 876 0 
gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 
tgaaccatca cccaaatcaa gttttttggg gt.cgaggtgc cgtaaagcac taaatcggaa 8880 
ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 894 0 
ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 900 0 
gatcggtgcg ggcctctt,cg ctattacgcc agctggcgaa agggggatg-b gctgcaaggc 9060 
gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 
aattaattcc catcttgaaa gaaatatagt ttaaatattt. attgataaaa taacaagtca 9180 
ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 
tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 9300 
tgtg-taatac ataaattgat gatatagcta gct/tagctca tcgggggaHc cgtcgaagct 9360 
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agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 
tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 9480 
tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 954 0 
.ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 
tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 96 60 
agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 9720 
gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9780 
gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9840 
gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 
tcccttcccg . ettcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtg^cc '9960 
agccacgata gccgcgctgc. ctcgtcctgc. agttcattca gggcaccgga caggtcggtc 10020 
ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 
ccgattgtct gttgtgccca - gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 
cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 102 00 
tctggattga gagtgaatat gagactctaa ttggataccg aggggaattt atggaacgtc 102 60 
agtggagcat ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 10320 
acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 
tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 104 40 
gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 
gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 
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cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 10620 
gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 
ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 
actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10800 
gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 
attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 
tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 
tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 
aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga moo 
attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 
ggatggcata tacaccaaac attcaataat. tcttgaggat aataatggta ccacacaaga 11220 
tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 
taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 
tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 
aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 11460 
ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 
agccagccca ccgcggtggg cggccgcctg cagtctagaa ggcctcctgc tttaatgaga 11580 
tatgcgagac gcctatgatc gcatgatatt tgctttcaat tctgttgtgc acgttgtaaa 11640 
aaacctgagc atg tgtagct cagatcctta ccgccggttt cggttcattc taatgaatat 11700 



atcacccgtt actatcgtat ttttatgaat aatattctcc gttcaattta ctgattgtcc 1 



1760 
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gtcgagcaaa tt.tacacat.t gccactaaac gtctaaaccc ttgtaatttg -tttttgtttt 11820 

actatgtgtg ttatgtattt gatttgcgat aaatttttat atttggtact aaatttataa 118 80 

caccttttat gctaacgttt gccaacactt agcaatttgc aagttgatta attgattcta 11940 

. aattattttt gtcttctaaa tacatatact aatcaactgg aaatgtaaat atttgctaat 12000 

atttctacta taggagaatt aaagtgagtg aatatggtac cacaaggttt ggagatttaa 12060 

ttgttgcaat gctgcatgga tggcatatac accaaacatt caataattct tgaggataat 12120 

aatggtacca cacaagattt gaggtgcatg aacgtcacgt ggacaaaagg tttag-taatt 12180 

tttcaagaca acaatgttac cacacacaag ttttgagg-tg catgcatgga tgccctgtgg 12240 

aaagt.tt.aaa aatattttgg aaatgatttg catggaagcc atgtgtaaaa ccatgacatc 12300 

cacttggagg atgcaataat gaagaaaact acaaatttac-'atgcaactag ttatgcatgt 12360 

ag-tctat:ata atgaggattt tgcaatactt. tcattcatac acactcacta agttttacac 12420 

gattataatt, tcttcatagc cagcggatcc gatatcgggc ccgctagcgt taaccctgct 12480 

ttaa-tgagat atgcgagacg cctatgatcg catgatattt gctttcaatt ctgttgtgca 12540 

cgttgtaaaa aacctgagca tgtgtagctc agatccttac cgccggtttc ggttcattct 12 600 

aatgaatata tcacccgtta ctatcgtatt tttatgaata atattctccg ttcaa-tttac 12 6 60 

tgat-tg-tccg tcgagcaaat ttacacattg ccactaaacg tctaaaccct -tgtaa-tttgt: 12 7 20 

ttttgtttta ctatgtgtgt tatgtatttg atttgcgata aatttttata tttggtacta 12780 

aatttataac accttttatg ctaacgtttg ccaacactta gcaatttgca agttgattaa 12 84 0 

ttga-ttctaa at-tatttttg tcttctaaat acatatacta atcaactgga aatgtaaata 12900 

tt-tgctaata tttctactat aggagaatta aagtgagtga atatggtacc acaaggtttg 12960 
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9 a,at ttaat t9ttgoaatg ctgc . t9gat ccoaacat , c ^ ^ 

9 a 99ataata atggtaccac acaa , atttg a5gtacjtga a=gtcac9tg 9a _^ ^ 
ttcaagaoaa caat9tta=c acieaca>9t ^^^^^ ^ 
9 ccc t9t5sa aagtttaaaa atattttgga aatgatttgo ^ 

=at ga „ t =c actt 9gag9a tgcaataatg aaga . aacta caaatttaca tgcaacta9t ij26o 
» t9 c at5ta 9tctatataa tgaggatt „ gcaatacttt ^^^^^ ^ 
9 ««.cac 9 attataattt cttcalag „ agcagatctg cc99catc?a ^ 
*„cc tg c« taatgagata tgcgaga=g= ctatgatcgc atgatattt9 cMtca ^ = ^ 
t 9 t l9t9 c.c 9 « 9taaaaa aootgagcat gtgtagotca gatccttaoc gccs9tt ^ 5 ^ 
g «c a «„ a atgaatatat cacccgttac tatcgtattt ttatgaat . a tattetcegt 1M 

tcaatttact gattgtccgt cgacgagctc ggcgcgcctc 

a ggcgcgcctc tagaggatcg atgaattcag 13620 

a,cggc tgag tggctccttc aacgttgcgg ^ 
tcccgcgtca tcggcgg ggg tcataacgtg actcccttaa ttctccgctc atgatcagat 13740 
^cgtttcc cgccttcagt tta aac tatc agtgtttgac aggatatatt ggcgggtaaa 13800 

cctaagagaa aagagcgttt atta g aataa tcggatattt aaaaacw^ 

vs l " aaaagggcgt gaaaa gg ttt 138 60 

atccttcgtc catttgtatg tgcatgccaa ccacagggtt 



60 



cccca 13905 



<210> 43 
<2ll> 15430 
<212> DNA 
<213> Unknown 

<220> 

<223> Plant expression vector with 



two 
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promoter-terminator expression cassettes, 

inserted is Physcomitrella patens elongase and desaturase 

<220> 
<221> CDS 

<222> (11543) (12415) 

<220> 
<221> CDS 

<222> (13313) (14890) 
<400> 43 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 
gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 

tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 

ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 24 0 

atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 

ttctttatca ctgataagtt ggtggacata. ttatgtttat cagtgataaa gtgtcaagca -360 

tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 420 

gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 

tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 

cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 

ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 

ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 

gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 

ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 
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ccgttgaaca ggctccgctc -tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 900 
ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 
ggaggctcgt tg-tcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 
tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 
ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct. gccctagcgt 1140 
ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 
gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcgg-tatcag ctcactcaaa 1260 
ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 1380 
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 144 0 
aggactataa agataccagg cgtttccccc tggaagctcc ctcg-tgcgct ctcctg-ttcc 1500 
gaccctgccg cttaccggat acctgtccgc. ctttctccct tcgggaagcg tggcgctttt 1560 
ccgctgcata accctgcttc ggggtcatta tagcgafttt ttcggtatat ccatcctttt 1620 
tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 16 80 
ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 
ctgtccc-tta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 
ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 
agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 
aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 
aaatcacggg cg-tcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 
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tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 
tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagc-ttg 2160 
gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 
aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2 2 80 
gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 
gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2 400 
cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gt.t>gtggata 2 4 60 
cctcgcggaa aact-tggccc tcactgacag a-tgaggggcg gacgttgaca cttgaggggc 2520 
cgactcaccc ggcgcggcgt tgacaqatga ggggcaggct cgatttcggc cggcgacgtg 2580 
gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt "tacgcgagtt tcccacagat 264 0 
gatgtggaca agcctgggga -taagtgccct gcgg-tattga cact-tgaggg gcgcgactac 2700 
tgacagatga ggggcgcgat ccttgacact. tgaggggcag agtgctgaca gatgaggggc 2760 
gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaaggg-ttt. 2820 
ccgcccgttt ttcggccacc gctaacctgt cttttaacct gct/tttaaac caatatttat 2880 
aaaccttgt/t tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2 940 
tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 
tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 
t/tgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 312 0 
ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 
gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gactitcatgg 3240 
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cggggccggc aatttttacc ttgggcattc tt ggcatagt ggtcgcgggt gccgtgctcg 3300 
tgttcgg ggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 33 60 
g tat ga aaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaa g 3420 
ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 
tactgataag ataatatatc tttfff g aagatatcgc cgtatgtaag gatttcaggg 3540 
ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 
tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 
attgggtaat gactccaact fttgafgt gtttt.tgtt cagataatgc ccgatgactt 3720 
tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 
gctgcctcag a tt cagg tta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3840 
gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 
cgtcaaaggg tgacagcagg ctcataagac. gccccagcgt cgccatagtg cgttcaccga 3960 
atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 
gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 
tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 
cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 
catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 
ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4320 
acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 
agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 
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tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 

aaaagctgtt ttctggtatt. taaggtttta gaatgcaagg aacagtgaat tggag-ttcgt 4560 

cttgttataa ttagcttctt ggggtatctt taaatactgt: agaaaagagg aaggaaataa 4620 

taaatggcta aaatgagaat, atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4 680 

gtaaaagata cggaaggaat. gtctcctgct aaggtatata agctggtggg agaaaatgaa 4 740 

aacctatatt taaaaatgac ggacagccgg tataaaggga ccaccta-tga tgtggaacgg 4 800 

gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4 860 

gaacggcatg a/tggctggag caatctgctc atgagtgagg ccgatggcgt cctiitgc'tcg 4 920 

gaagagtatg aagatgaaca aagccctgaa aagattatcg agct.gt/atgc ggagtgcatc 4 980 

aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 

ttagccgaat tggattact/t actgaa.taac gatctggccg a-tg-tggattg cgaaaactgg 5100 

gaagaagaca ctccatttaa agatccgcgc. gagctgtatg attttttaaa gacggaaaag 5160 

cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ct/ttgtgaaa 5220 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 

gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatg-tcgag 5340 

ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata -t-tatatttta 5400 

ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 

gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 

cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 
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ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 
cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 
cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 
gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 
gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 
ttcgcgtcg-t ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 
aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacagg-tcag 6060 
cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 
ttccttgt.tc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 
ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 
ggtcatt/ttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 
cgacgatgac gaactggtgt ggcagcaggt. gttggagtac gcgaagcgca cccctatcgg 6360 
cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 
ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggcrtt 6480 
cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcg-tcct 6540 
ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6 600 
gtttgctggc gaccactaca cgaaattcat. atgggagaag taccgcaagc tgtcgccgac 6660 
ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 
aacc-ttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 
cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6840 
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tgacctggtg cattgcaaac gctagggcct -tgtggggtca gttccggctg ggggttcagc 6900 
agccagcgc-t ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 
tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7020 
ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 
cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgt-tcgggtc cgtttacgag 7140 
cacgaggaga aaaagcccat ggaggcgttc gctgaacggl: tgcgagatgc cgtggcattc 7200 
ggcgcctaca tcgacggcga gatcattggg ctgtcggtct. tcaaacagga ggacggcccc 7 2 60 
aaggacgctc acaaggcgca tctgtccggc gt.ttt.cgtgg agcccgaaca gcgaggccga 7320 
gggg-tcgccg gtatgctgct ' gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7380 
cgacagattc caacg.ggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 7440 
tcgctattct ggagcttgtt gt.ttat.ttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7 500 
acggtaggcg ctgtgcagcc gctgatggtc . gtgttcatct ctgccgctct gctaggtagc 7 560 
ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgctgttg 7620 
gtgttgacac caaacgcagc gctagatcct gtcggcgtcg cagcgggcct ggcgggggcg 7 680 
gtttccatgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 
acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccagt agctttagtg 7 800 
tttgatccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 7860 
ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7920 
acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 79 80 
catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 804 0 
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aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 
cggctttatc cagcgatttc ct.tt.tgtc ggcatagttc tcaagatcga cagcctgtca 8160 
cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 
tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 
tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 834 0 
gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 
gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 8460 
tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 
gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 
tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8 640 
gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 
agaatagccc gagatagggt tgagtgttgt. tccagtttgg aacaagagtc cactattaaa 8760 
gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 
tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 
ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8940 
ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 
gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 
gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 
aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 
ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 
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tcaataactg attatatcag ct.ggtacatt gccgtagatg aaagactgag tgcgatatta 9300 
tgtgtaatac ataaattgat gatatagota gcttagctca tcgggggatc cgtcgaagct 93 60 
agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 
tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 9480 
tcagcaatat cacgggtagc caacgctatg tcctga-tagc ggtccgccac acccagccgg 9540 
ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 
tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgcct/tgag cctggcgaac 9660 
agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 9 720 
gcttccatcc gagtacgtge tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9780 
g-tagccggat caagcgt.at.g cagccgccgc attgcatcag ccat,ga*tgga tactttctcg^9840 
gcaggagcaa ggtgagatga caggagatcc tgccceggca cttcgcccaa tagcagccag 9900 
tcccttcccg ct-tcagtgac aacgtcgagc. acagctgcgc aaggaacgcc cgtcgtggcc -9960 
agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc 10020 
ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 
ccgattgtct gttg-tgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 
cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 
tctggattga gagtgaatat. gagactctaa ttggataccg aggggaattt atggaacgtc 10260 
agtggagcat. -ftt/tgacaag aaatatttgc tagctgatag tgaccttagg cgacttt-tga 10320 
acgcgcaata atgg-tttctg acgtatgtgc ttagctcatt. aaactccaga aacccgcggc 10380 
tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 
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9tc a tc ggcg ggggtcataa cgtgactcco ttaattctcc gctcatg , tc ttgatcccct 1050o 
9 c g cc atcag at =c ttggcg gcaagaaagc catccagttt , otttgca9g gcttoccaac io56o 
c« a cc agag ggcgccccag ctggcaattc cggttog „ t gclgtccata aaa „ 9ccca 1Q620 
gt «a 9 c tat c g cc atgtaa gcccactg=a agctacctg= tttctctttg cgcttgom io6so 
tteott*. = agatagc « agtagctgac attcat „ gg ggtcageaoc gtttctgcgg 10? 
*~ ggctttc tacgtgttcc gcttccttta gcagcccttg cgc „ tgagt gcttg=ggca 
9 c g t gaag c* tg c a t g=ctg c, ggt e gaog gcgcg cc gag c,cc t = gagc aaattta=ao 10860 

cc« tg „« « gtttttgt « tactatgt gtg « atgta 10920 

« tga « tgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 109eo 
t» g « aaca =ttagcaa « tgcaagttga ttMttw , t ctaaattatt wgtottct 11040 

...t Ktttt a ct aat c aa = tggaaaegta aatalttgct aatatttcta ctataggaga lnoo 

=« aaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgeat tllM 

«.t„e.t. t.c a cc aaa c ««U. tC« gaggat aataatggta ooacacaaga mj| , 

ttt gagglgo atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt UJ 

t.cc a = aca = aagttttgag gtgcatg<:at ggatgccctg tggaaagttt aaaaatattt n3 

t ggaaatg .t ttgcatggaa gccatgtgta aaac=atgae atccacttg9 tmnvMC ll400 

a« gaagaaa actacaaa « ta „ tgcaac tagttatgca tgtagtctat ataatgagga n460 

«« g e aata c« tc « t c a ta c acaotca « aagtttta c a c gattala « ttcttcal 11S20 

""""" " gtc W gag aga tto tac gag 

Met Glu val v a l Glu Ai g P„e Tyt Gly olu 
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ttg gat ggg aag gtc teg cag ggc gtg aat gca ttg ctg ggt agt ttt 11620 
Leu Asp Gly Lys Val Ser Gin Gly Val Asn Ala Leu Leu Gly Ser Phe 

15 .20 25 

ggg gtg gag ttg acg gat acg ccc act acc aaa ggc ttg ccc etc gtt 116 68 
Gly Val Glu Leu Thr Asp Thr Pro Thr Thr Lys Gly Leu Pro Leu Val 
30 35 40 

gac agt ccc aca ccc ate gtc etc ggt gtt tct gta tac ttg act att 11716 
Asp Ser Pro Thr Pro lie Val Leu Gly Val Ser Val Tyr Leu Thr lie 
45 50 55 

gtc att gga ggg ctt ttg tgg ata aag gec agg gat ctg aaa ccg cgc 117 64 
Val He Gly Gly Leu Leu Trp He Lys Ala Arg Asp Leu Lys Pro Arg 

60 65 70 _ . -. 

gec teg gag. cca ttt ttg etc caa get ttg gtg ctt gtg cac aac ctg 11812 
Ala Ser Glu Pro Phe Leu Leu Gin Ala Leu Val Leu Val His Asn Leu 
75 80 - 85 - 90 

ttc tgt ttt gcg etc agt ctg tat atg tgc gtg ggc ate get tat cag 11860 
Phe Cys Phe Ala Leu Ser Leu Tyr Met Cys Val Gly He Ala Tyr Gin 

95 100 105 

get att acc tgg egg tac tct etc tgg ggc aat gca tac aat cct aaa 11908 
Ala He Thr Trp Arg Tyr Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys 
110 115 120 

cat aaa gag atg gcg att ctg gta tac ttg ttc tac atg tct aag tac 11956 
His Lys Glu Met Ala He Leu Val Tyr Leu Phe Tyr Met Ser Lys Tyr 
125 130 135 

gtg gaa ttc atg gat acc gtt ate atg ata ctg aag cgc age acc agg 12004 
Val Glu Phe Met Asp Thr Val He Met lie Leu Lys Arg Ser Thr Arg 
140 145 150 

caa ata age ttc etc cac gtt tat cat cat tct tea att tec etc att 12052 
Gin He Ser Phe Leu His Val Tyr His His Ser Ser He Ser Leu He 
155 160 165 170 
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x z = ::: = :;: = = »• ~ - = = = - - ..... 

175 180 185 

gcg get ctg aac tea gga gtq cat ot f ^ 

Ala Ala Leu Asn ser £ HI HI HI Le m ^ 12148 

y Val Hxs val Leu Met Tyr Ala Tyr Tyr Phe 

195 200 

ttg get gec tgc ctt cga agt age cea aag tta aaa aat aan - 
L eu Ala Ala Cys Leu A,g Ser Ser Pro Lys Leu Lys ^ " 9 m 12196 

205 7 LyS Tyr 



235 



t« t W ,gc a 99 ta= «, .„ e .a tte c. ,t 9 ttc cm « t „ 
- £ «, ar9 ^ Le „ Thr „„ phe sln J - - «. 12244 



" 3 230 



::: z ti z: z ;;: - - - = « : « r r - 

235 ... Asn Ala Pro Tyr Pro 



240 245 

^ 45 250 



caa tgg ctg ate aag att ttg ttc tae t*,- a * 

Gin Trp Leu He Lys u. Jf " 9 **° tC9 ** g Ct * ttfc 12340 

Lys Xle Leu phe Tyr Tyr L ^ l ^ ^ 

260 265 
ctt ttc ggc aat ttt tae gta caa aaa tae ate aaa eec * «. 

- p. Gly A Phe Tyr val Gln Lys Tyr - - - gga 123 sa 

275 280 

aag caa aag gga get aaa act gag tga tctagaagge etcetacttt 
Lys Gin Lys Gly Ala Lys Thr Glu cteetgettt 12435 

285 



290 

9 e gaga=9 « tat9atcg „ tsatatttgc tttcaatt=t ^ 
„ tgagcatg t9tagctca9 atccttacc9 ec99tttcgg ^ 

t««t.to TO « cgtatttt tatgaataat attctc=gtt ^ 
•«^o 9 . 9= „. ttt acacattg „ actaaacgt= taaa=cctt9 
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ttgttttact atgtgtgtta tgtatttgat ttgcgataaa tttttatatt tggtactaaa 12 735 

tttataacac cttttatgct aacgtttgcc aacacttagc aatttgcaag ttgattaatt 12 795 

gattctaaat tatttttgtc ttctaaatac atatactaat caactggaaa tgtaaatatt 12 855 

tgctaatatt tctactatag gagaattaaa gtgagtgaat atggtaccac aaggtttgga 12915 

gatttaattg ttgcaatgct gcatggatgg catatacacc aaacattcaa taattcttga 12975 

ggataataat ggtaccacac aagatttgag gtgcatgaac gtcacgtgga caaaaggttt 13035 

agtaattttt caagacaaca atgttaccac acacaagttt tgaggtgcat gcatggatgc 13095 

cctgtggaaa gtttaaaaat attttggaaa tgatttgcat ggaagccatg tgtaaaacca 13155 

tgacatccac ttggaggatg caataatgaa gaaaactaca aatttacatg caactagtta 13215 

tgcatgtagt ctatataatg aggattttgc aatactttca ttcatacaca ctcactaagt 13275 

tttacacgat tataatttct tcatagccag cggatcc atg gta ttc gcg ggc ggt 13330 

Met Val Phe Ala Gly Gly 
2 95 

gga ctt cag cag ggc tct etc gaa gaa aac ate gac gtc gag cac att 1337 8 
Gly Leu Gin Gin Gly Ser Leu Glu Glu Asn lie Asp Val Glu His lie 
300 305 310 

gec agt atg tct etc ttc age gac ttc ttc agt tat gtg tct tea act 1342 6 
Ala Ser Met Ser Leu Phe Ser Asp Phe Phe Ser Tyr Val Ser Ser Thr 
315 320 325 

gtt ggt teg tgg age gta cac agt ata caa cct ttg aag cgc ctg acg 13474 
val Gly Ser Trp Ser Val His Ser lie Gin Pro Leu Lys Arg Leu Thr 
330 335 340 345 

agt aag aag cgt gtt teg gaa age get gec gtg caa tgt ata tea get 13522 
Ser Lys Lys Arg Val Ser Glu Ser Ala Ala Val Gin Cys He Ser Ala 
350 355 360 
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gaa gtt cag aga aat teg agt acc cag gga act gcg gag gca etc gea 13570 
Glu val Gin Arg Asn Ser Ser Thr Gin Gly Thr Ala Glu Ala Leu Ala 

370 375 



365 



gaa tea gte gtg aag ce= acg aga ega agg tea tet cag tgg aag aag 13618 
Glu Ser Val Val- Lys Pro Thr Arg Arg Arg Ser Ser Gin Trp Lys Lys 

385 3*° 



380 



teg aea cac cce eta tea gaa gta gea gta eae aae aag eea age gat 13666 
Ser Thr His Pro Leu Ser Glu Val Ala val His Asn Lys Pro Ser Asp 



395 



tge tgg att gtt gta aaa aae aag gtg tat gat gtt tee aat ttt gcg 13714 

C ys Trp lie Val val Lys Asn Lys val Tyr Asp val Ser Asn Phe Ala 

410 

gac gag cat ec* gga gga tea gtt att agt act tat ttt gga ega gae 137 62 
ASP Glu His Pro Gly Gly Ser Val He Ser Thr Tyr Phe Gly Arg Asp 

440 



430 



435 



gge aca gat gtt tte tet agt ttt eat g ea get tet aea tgg aaa att 13810 
Gly Thr Asp Val Phe Ser Ser Phe His Ala Ala Ser Thr Trp Lys Xle 
445 * 50 

ctt eaa gac ttt tac att ggt gae gtg gag agg gtg gag eeg act eea 13858 
L eu Gin Asp Phe Tyr He Gly Asp Val Glu Arg Val Glu Pre Thr Pro 



460 



465 



470 



,„ «, =t, aaa ,« «c c. «9 >■» «<=<= =« «= =t, .,9 9>9 1390. 

S lu Leu Leu Lys Asp p h e Ar, Glu M.t Ar, Ala Leu Phe Leu Ar, Glu 



475 



480 



485 



caa 



... ctt tte aaa agt teg aaa ttg tac tat gtt atg aag ctg etc acg 
Gin Leu Phe Lys Ser Ser Lys Leu Tyr Tyr Val Met Lys Leu Leu Thr 
490 

aat gtt get att ttt get gcg age att gea ata ata tgt tgg age aag 14002 
A sn Val Ala He Phe Ala Ala Ser He Ala He He Cys Trp Ser Lys 
510 515 "0 
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act att tea gcg gtt ttg get tea get tgt atg atg get ctg tgt ttc 14050 

Thr lie Ser Ala Val Leu Ala Ser Ala Cys Met Met Ala Leu Cys Phe 

525 530 535 

caa cag tgc gga tgg eta tec cat gat ttt etc cac aat cag gtg ttt 14098 

Gin Gin Cys Gly Trp Leu Ser His Asp Phe Leu His Asn Gin Val Phe 
540 545 550 

gag aca cgc tgg ctt aat gaa gtt gtc ggg tat gtg ate ggc aac gec 14146 

Glu Thr Arg Trp Leu Asn Glu Val Val Gly Tyr Val lie Gly Asn Ala 
555 560 565 

gtt ctg ggg ttt agt aca ggg tgg tgg aag gag aag cat aac ctt cat 14194 

Val Leu Gly Phe Ser Thr Gly Trp Trp Lys Glu Lys His Asn Leu His 

570 f . 575 _ 5.80 ... 585 

cat get get cca aat gaa tgc gat cag act tac caa cca att gat gaa 14242 

His Ala Ala Pro Asn Glu Cys Asp Gin Thr Tyr Gin Pro lie Asp Glu 

. 590 I! 595 "- 600: ' 

gat att gat act etc ccc etc att-gee tgg age aag gac ata ctg gec 14290 

Asp lie Asp Thr Leu Pro Leu lie Ala Trp Ser Lys Asp lie Leu Ala 

605 610 615 



aca gtt gag aat aag aca 
Thr Val Glu Asn Lys Thr 
620 

ttc ttc atg ggt ctg tta 
Phe Phe Met Gly Leu Leu 
635 

age tgg aga tat ace tct 
Ser Trp Arg Tyr Thr Ser 
650 655 



ttc ttg cga ate etc caa 

Phe Leu Arg lie Leu Gin 
625 

ttt ttc gee cgt ggt agt 

Phe Phe Ala Arg Gly Ser 

640 645 

aca gca gtg etc tea cct 

Thr Ala Val Leu Ser Pro 
660 



tac cag cat ctg 14338 

Tyr Gin His Leu 

630 

tgg etc ttt tgg 14386 
Trp Leu Phe Trp 

gtc gac agg ttg 14434 
Val Asp Arg Leu 
665 



ttg gag aag gga act gtt ctg ttt cac tac ttt tgg ttc gtc ggg aca 
Leu Glu Lys Gly Thr Val Leu Phe His Tyr Phe Trp Phe Val Gly Thr 
670 675 680 



14482 
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gcg tgc tat ctt etc cct ggt tgg aag cca tta gta tgg atg gcg gtg 145 3 0 
Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro Leu Val Trp Met Ala Val 
685 690 695 

act gag etc atg tec ggc atg ctg ctg ggc ttt gta ttt gta ctt age 145 78 
Thr Glu Leu Met Ser Gly Met Leu Leu Gly Phe Val Phe Val Leu Ser 
700 705 710 

cac aat ggg atg gag gtt tat aat teg tct aaa gaa ttc gtg agt gca 1462 6 
His Asm Gly Met Glu Val Tyr Asn Ser Ser Lys Glu Phe Val Ser Ala 
715 720 725 

cag ate gta tec aca egg gat ate aaa gga aac ata ttc aac gac tgg 1467 4 
Gin lie Val Ser Thr Arg Asp He Lys Gly Asn He Phe Asn Asp Trp 
730 735 740 745 

ttc act ggt ggc ctt aac agg caa ata gag cat cat ctt ttc cca aca 14722 
Phe Thr Gly Gly Leu Asn Arg Gin He Glu His His Leu Phe Pro Thr 
750 755 760 

atg ccc agg cat aat tta aac aaa ata gca cct aga gtg gag gtg ttc 14 7 70 
Met Pro Arg His Asn Leu Asn Lys lie Ala Pro Arg Val Glu Val Phe 
765 770 775 

tgt aag aaa cac ggt ctg gtg tac gaa gac gta tct att get ace ggc 14 818 
Cys Lys Lys His Gly Leu Val Tyr Glu Asp Val Ser lie Ala Thr Gly 
780 785 790 

act tgc aag gtt ttg aaa gca ttg aag gaa gtc gcg gag get gcg gca 14866 
Thr Cys Lys Val Leu Lys Ala Leu Lys Glu Val Ala Glu Ala Ala Ala 
795 800 805 

gag cag cat get ace acc agt taa getagegtta accctgcttt aatgagatat 14920 
Glu Gin His Ala Thr Thr Ser 
810 815 

gcgagacgcc tatgatcgea tgatatttgc tttcaattct gttgtgcacg ttgtaaaaaa 14980 



cctgagcatg tgtagctcag atccttaccg ccggtttcgg ttcattctaa tgaa-tatatc 15040 
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acccgttact atcgtatttt tatgaataat attctccgtt caatttactg attgtccgtc 15100 

gacgaattcg agctcggcgc gcctctagag gatcgatgaa ttcagatcgg ctgagtggct 15160 

ccttcaacgt tgcggttctg tcagttccaa acgtaaaacg gcttgtcccg cgtcatcggc 15220 

gggggtcata acgtgactcc cttaattctc cgctcatgat cagattgtcg tttcccgcct 15280 

tcagtttaaa ctatcagtgt ttgacaggat atattggcgg gtaaacctaa gagaaaagag 15340 

cgt.tt.a-tt.ag aataatcgga tatttaaaag ggcgtgaaaa ggtttatcct tcgtccattt 154 00 
gtatgtgcat gccaaccaca gggttcccca 15430 



<210> 44 
<211> 290 
<212> PRT 
<213> Unknown 

<400> 44 

Met Glu Val Val Glu Arg Phe Tyr Gly Glu Leu Asp Gly Lys Val Ser 
1 5 10 15 

Gin Gly Val Asn Ala Leu Leu Gly Ser Phe Gly Val Glu Leu Thr Asp 
20 25 30 

Thr Pro Thr Thr Lys Gly Leu Pro Leu Val Asp Ser Pro Thr Pro lie 
35 40 45 

Val Leu Gly Val Ser Val Tyr Leu Thr lie Val lie Gly Gly Leu Leu 
50 55 60 

Trp lie Lys Ala Arg Asp Leu Lys Pro Arg Ala Ser Glu Pro Phe Leu 
65 70 75 80 

Leu Gin Ala Leu Val Leu Val His Asn Leu Phe Cys Phe Ala Leu Ser 

85 90 95 



0093/00064 



CA 02485060 2004-10-28 



176 

Leu Tyr Met Cys Val Gly lie Ala Tyr Gin Ala He Thr Trp Arg Tyr 
100 105 110 

Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys His Lys Glu Met Ala He 
115 120 125 

Leu Val Tyr Leu Phe Tyr Met Ser Lys Tyr Val Glu Phe Met Asp Thr 
130 135 140 

Val He Met He Leu Lys Arg Ser Thr Arg Gin He Ser Phe Leu His 
14 5 150 155 160 

Val Tyr His His Ser Ser He Ser Leu He Trp Trp Ala He Ala His 
165 170 175 

His Ala Pro Gly Gly Glu Ala Tyr Trp Ser Ala Ala Leu Asn Ser Gly 
180 185 190 

Val His Val Leu Met Tyr Ala Tyr Tyr Phe Leu Ala Ala Cys Leu Arg 
195 200 205 

Ser Ser Pro Lys Leu Lys Asn Lys Tyr Leu Phe Trp Gly Arg Tyr Leu 
210 215 220 

Thr Gin Phe Gin Met Phe Gin Phe Met Leu Asn Leu Val Gin Ala Tyr 
225 230 235 240 

Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro Gin Trp Leu He Lys He 
245 250 255 

Leu Phe Tyr Tyr Met He Ser Leu Leu Phe Leu Phe Gly Asn Phe Tyr 
260 265 270 

Val Gin Lys Tyr He Lys Pro Ser Asp Gly Lys Gin Lys Gly Ala Lys 
275 280 285 



Thr Glu 
290 
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<210> 45 
<211> 525 
<212> PRT 
<213> Unknown 

<400> 45 

Met Val Phe Ala Gly Gly Gly Leu Gin Gin Gly Ser Leu Glu Glu Asn 
15 10 15 

lie Asp Val Glu His lie Ala Ser Met Ser Leu Phe Ser Asp Phe Phe 
20 25 30 

Se'r Tyr Val Ser Ser Thr Val Gly Ser Trp Ser Val His Ser lie Gin 
35 40 45 

Pro Leu Lys Arg Leu Thr Ser Lys Lys Arg Val Ser Glu Ser Ala Ala 
50 55 60 

Val Gin cys lie Ser Ala Glu Val Gin Arg Asn Ser Ser Thr Gin Gly 
65 70 7 5 80 

Thr Ala Glu Ala Leu Ala Glu Ser Val Val Lys Pro. Thr Arg Arg Arg 

85 90 95 

Ser Ser Gin Trp Lys Lys Ser Thr His Pro Leu Ser Glu Val Ala Val 
100 105 110 

His Asn Lys Pro Ser Asp Cys Trp lie Val Val Lys Asn Lys Val Tyr 
115 120 125 

Asp Val Ser Asn Phe Ala Asp Glu His Pro Gly Gly Ser Val lie Ser 
130 135 140 

Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val Phe Ser Ser Phe His Ala 
145 150 155 160 

Ala Ser Thr Trp Lys lie Leu Gin Asp Phe Tyr He Gly Asp Val Glu 
165 170 175 



Arg Val Glu Pro Thr Pro Glu Leu Leu Lys Asp Phe Arg Glu Met Arg 
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180 



185 



1*9 0 



Ala Leu Phe Leu Arg Glu Gin Leu Phe Lys Ser Ser Lys Leu Tyr Tyr 
195 2 °0 205 

Val Met Lys Leu Leu Thr Asn Val Ala lie p he Ala Ala Ser Jle ^ 

215 220 

lie Xle Cys Trp Ser Lys Thr He Ser Ala Val Leu Ala Ser Ala Cys 



230 



235 



240 

Met Met Ala Leu Cys Phe Gin Gin Cys Gly Trp Leu Ser His Asp Phe 



245 



250 



255 



Leu His Asn Gin Val Phe Glu Thr Arg Trp Leu Asn Glu Val Val Gly 

Tyr val H e Gly Asn Ala Val Lfiu cly phe ^ ^ ^ ^ ^ ^ 
275 280 285 

Glu Lys His Asn Leu His His Ala Ala Pro Asn Glu Cys Asp Gin Thr 
290 2 95 300 

Tyr Gin Pro He Asp Glu Asp n e Asp Thr Leu Pro Leu He Ala Trp 

310 -,,e 

315 320 

Ser Lys Asp He Leu Ala Thr Val Glu Asn Lys Thr Phe Leu Arg lie 

325 33 ° 335 

Leu Gin Tyr Gin His Leu Phe Phe Met Gly Leu Leu Phe Phe Ala Arg 



340 345 



350 



Gly Ser Trp Leu Phe Trp Ser Trp Arg Tyr Thr Ser Thr Ala Val Leu 
355 3 «0 365 

Ser Pro Val Asp Arg Leu Leu Glu Lys Gly Thr Val Leu Phe His Tyr 
370 37 5 380 

Phe Trp Phe Val Gly Thr Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro 



390 



395 



400 
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Leu Val Trp Met. Ala Val Thr Glu Leu Met Ser Gly Met Leu Leu Gly 
405 410 415 

Phe Val Phe Val Leu Ser His Asn Gly Met Glu Val Tyr Asn Ser Ser 
420 425 430 

Lys Glu Phe Val Ser Ala Gin lie Val Ser Thr Arg Asp lie Lys Gly 
435 440 445 

Asn lie Phe Asn Asp Trp Phe Thr Gly Gly Leu Asn Arg Gin lie Glu 
450 455 460 

His His Leu Phe Pro Thr Met Pro Arg His Asn Leu Asn Lys He Ala 
465 470 475 480 

Pro Arg Val Glu Val Phe Cys Lys Lys His Gly Leu Val Tyr Glu Asp 
485 490 495 

Val Ser He Ala Thr Gly Thr Cys Lys Val Leu Lys Ala Leu Lys Glu 
500 505 510 

Val Ala Glu Ala Ala Ala Glu Gin His Ala Thr Thr Ser 
515 520 . 525 



<210> 46 
<211> 17752 
<212> DNA 
<213> Unknown 

<220> 

<223> Plant expression vector with 3 

promoter-terminator expression cassettes, 

inserted with Physcomitrella elongase + desaturase 

+ Phaeodactylum desaturase 

<220> 
<221> CDS 

<222> (11543) (12415) 
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<220> 
<221> CDS 

<222> (13313). .(14890) 

<220> 
<221> CDS 

<222> (15791).. (17200) 
<400> 46 



gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 
gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 
tagtgggcgg tgacgtcg« cgag.gaacc agatcgcgca ggaggcccgg cagcaccggc 180 
ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaa^ac gatcaggggt 240 
atgttgggtt t ca= gtCt gg cctceggacc agcc.ccgct ggtccgatfg aacgcgcgga 300 
ttcttt atca ctgataag^ ggtggacata. ttatgtttat cagtgataaa gtgtcaagca 360 
.gacaaagtt gcagccgaat acagtgatcc g t gccgcc= t ggacc t g tt g aacgaggtcg 420 
gcgtagacgg tctgacgaca cgcaaactgg" cggaacggtt ggggg tt cag cagccggcgc 4 80 
tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgc.gg 540 
cggag aa t ca tacgcattc, gt gc= g a g a g ccgacgacga ctggcgctca «*etg«te. 600 
ggaatgcccg cagcttcagg caggcgC.c tcgcctaccg cga.ggcgcg cgcatccatg 660 
ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 
gcgaggcggg 



«t«cggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 



ctgttggggc cgtgc tt gag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 
ccgttgaaca ggctccgctc t cgccgctgt tgcgggccgc ga.agacgcc ttcgacgaag 900 
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ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 
ggaggctcgt. tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 
tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 
ccaccgcgtc agacgcccgt agcagcccgc tacgggctt/t ttcatgccct gccctagcgt 1140 
ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 
gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 1260 
ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 
aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgttt.t tccataggct 1380 
ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 
aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 
gaccctgccg cttaccggat. acctgtccgc- ctttctccct tcgggaagcg tggcgctttt; 1560 
ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 1620 
tcgcacgata tacaggattt. tgccaaaggg ttcgtgtaga ct-ttccttgg tgtatccaac 1680 
ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 
ctgtccctta -ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 
ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 
agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 
aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 
aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 
tgggccgcc-t gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcgg-t 2100 
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tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 
gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc .tgaettttt tagccgctaa 2220 
aacggccggg ggg t g=gcg t gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 
gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2 340 
gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccg t c*a t g gccctgcaaa 2400 
cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 2460 
cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 
cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 
gagctggcca gcctcgcaaa tcggcgaaaa cgcctgatt* tacgcgagtt tcccacagat 2640 
gatgtggaca agcctgggga taagtgccct; gcggtattga cacttgaggg gcgcgactac 2700 
tgacagatga ggggcgcgat ccttgacact . tgaggggcag agtgctgaca gatgaggggc 2760 
gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2 820 
ccgcccgtt* ttcggccacc gctaacctgt" cttttaacct gcttttaaac caatatttat 2880 
aaaccttgtt tttaaccagg gctgcgccct. gtgcgcgtga ccgcgcacgc cgaagggggg 2940 
tgccccccc* tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 
tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 
ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 
ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 
gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 
cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcggg* gccgtgctcg 3300 
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tgttcggggg tgcgataaac ccagcgaacc 
gtatgaaaac gagaattgga cctttacaga 
ctaccaagac gaagaggatg aagaggatga 
tactgataag ataatatatc ttttatatag 
ggcaaggcat aggcagcgcg cttatcaata 
tgcatggact aat.gctt.gaa acccaggaca 
at'tgggtaat gactccaact. tattgatagt 
tgtcatgcag ctccaccgat tttgagaacg 
gctgcctcag attcaggtta tgccgctcaa 
gcagctttcc cttcaggcgg gattcataca. 
cgtcaaaggg tgacagcagg ctcataagac 
atacgtgcgc aacaaccgtc ttccggagac 
gcgatttagc cccgacatag ccccactgtt 
tgcccggctg -tatgcgcgag gttaccgact 
cgtgttgagg ccaacgccca taatgcgggc 
catatcaatg attttctggt. gcgtaccggg 
ccatgtttta cggcagtgag agcagagata 
acgcaccacc ccgtcagtag ctgaacagga 
agcacctcaa aaacaccatc atacactaaa 
tggtttcaaa atcggctccg tcgatactat. 



183 

atttgaggtg ataggtaaga ttataccgag 3360 

attactctat gaagcgccat atttaaaaag 3420 

ggaggcagat tgccttgaat atattgacaa 3480 

aagatatcgc cgtatgtaag gatttcaggg 3540 

tatctataga atgggcaaag cataaaaact 3600 

ataaccttat agcttg-taaa ttctatcata 3660 

gtttta-tgtt cagataatgc ccgatgac-tt 3720 

acagcgactt ccgtcccagc cgtgccaggt 3780 

ttcgctgcgt. atatcgcttg ctgattacgt 3 84 0 

gcggccagcc atccgtcatc catatcacca 3900 

gccccagcgt cgccatag-tg cgttcaccga 3960 

tgtcatacgc gtaaaacagc cagcgctggc 4 02 0 

cgtccatttc cgcgcagacg atgacgtcac 4080 

gcggcctgag tttt.ttaagt gacgtaaaat 414 0 

tgttgcccgg catccaacgc cattcatggc 4200 

ttgagaagcg gtgtaagtga actgcagttg 42 6 0 

gcgctgatgt. ccggcggtgc ttttgccgtt 432 0 

gggacagct-g atagacacag aagccactgg 4380 
tcagtaagt/t ggcagcatca cccataattg 4 440 

gttatacgcc aactttgaaa acaactttga 4500 



0093/00064 CA 02485060 2004-10-28 

184 

aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 
cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4620 
taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 
gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 
aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 
gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 
gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcg* cctttgctcg 4920 
gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 
aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 
ttagccgaat tggattactt- actgaataac gatctggccg atgtggattg cgaaaactgg 5100 
gaagaagaca ctccatttaa agatccgcgc. gagctgtatg attt.tttaaa gacggaaaag 5160 
cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 
gatggcaaag taagtggctt tattgatct* gggagaagcg gcagggcgga caagtggtat 5280 
gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 
ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 
ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 
caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 
gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 
cggccagacg gtc t acgg ga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 
ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 
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cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 
cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 
gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 
gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 
ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 
aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 
cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 
ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 
ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 
ggtcattttc cacgtcaaca aggacgtgaa- gatcacctac :accggcgtcg agctgcgggc 6300 
cgacgatgac gaactggtgt ggcagcaggt. gtt.ggagt.ac gcgaagcgca cccctatcgg 6360 
cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 
ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 6480 
cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 
ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 
gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 
ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 
aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 
cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6840 
tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 
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agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 
tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7020 
ttgacaattg tgattaaggc tcagat/tcga cggcttggag cggccgacgt gcaggatttc 7080 
cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 7140 
cacgaggaga aaaagcccat ggaggcgttc gctgaacggt tgcgagatgc cgtggcattc 7 2 00 
ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga ggacggcccc 7260 
aaggacgctc acaaggcgca tctgtccggc gttttcgtgg agcccgaaca gcgaggccga 7320 
ggggtcgccg gtatgctgct. gcgggcgttg ccggcgggtt tattgctcgt gatgatcgtc 7 380 
cgacagattc caacgggaat ctggtggatg cgcatcttca tcctcggcgc acttaatatt 7440 
tcgctattct ggagcttgtt gtttatttcg gtctaccgcc tgccgggcgg ggtcgcggcg 7 500 
acggtaggcg ctgtgcagcc gctgatggtc. gtgttcatct ctgccgctct gctaggtagc 7560 
ccgatacgat tgatggcggt cctgggggct atttgcggaa ctgcgggcgt ggcgctgttg 7620 
gtgttgacac caaacgcagc gctagatcct. gtcggcgtcg cagcgggcct ggcgggggcg 7680 
gtttccatgg cgttcggaac cgtgctgacc cgcaagtggc aacctcccgt gcctctgctc 7740 
acctttaccg cctggcaact ggcggccgga ggacttctgc tcgttccagt agctttagtg 7 8 00 
tttgatccgc caatcccgat gcctacagga accaatgttc tcggcctggc gtggctcggc 7 8 60 
ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7920 
acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7 980 
catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8 040 
aggggagttg atatcgtcaa cgt.tcact.tc taaagaaata gcgccactca gcttcctcag 8100 
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cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 
cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 
tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 
tcatccg-tg-t ttcaaacccg gcagcttagt tgccg-ttctt ccgaatagca tcggtaacat 8 34 0 
gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 
gcctgtatcg agtggtgat/t ttgtgccgag ctgccggtcg gggagctgtrt ggctggctgg 84 60 
tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 85 20 
gacgttttta atgtactggg gtggt/ttttc ttttcaccag tgagacgggc aacagctgat. 8580 
tgcccttcac cgcctggccc tgagagagttr -gcagcaagcg gtccacgc-tg gtttgcccca 8640 
gcaggcgaaa. a-bcctgtttg atggtggttc 'cgaaatcggc aaaatccctt^ ataaatcaaa" 8700 
agaatagccc gagatagggt. tgagtgttgt tccact.ttgg aacaagagtc cactattaaa' r 8760 
gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 
tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 
ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8940 
ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 
gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 
gattaagttg ggtaacgcca gggtt-ttccc agtcacgacg ttg-taaaacg acggccagtg 9120 
aa-ttaat-tcc catct/tgaaa gaaatatagt ttaaatatt/t attgataaaa taacaagtca 9180 
ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt. cagaaa-tatt. 9240 
tcaataactg attatatcag ctggtaca-tt gccgtagatg aaagactgag tgcgatatta 9300 
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tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 93 6 0 
agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 
tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 94 8 0 
tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 9540 
ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 
tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 96 60 
agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 972 0 
gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 97 8 0 
gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9840 
gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 99 00 
tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 99 6 0 
agccacgata gccgcgcrtgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc 10020 
ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 100 BO 
ccgattgtct gttgtgccca gtcatagccg aatagcctcrt ccacccaagc ggccggagaa 10140 
cctgcgtgca atccatct/tg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 
tctggattga gagtgaatat gagactctaa ttggataccg aggggaattt atggaacgtc 10260 
agtggagcat ttttgacaag aaatatttgc tagctgatag tgacct/tagg cgacttttga 1032 0 
acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 
tgagtggctc cttcaacgtt gcggttctg-t cagttccaaa cgtaaaaegg cttgtcccgc 104 4 0 
gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 
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gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 

cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 10620 

gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 

ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 

actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10800 

gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 

attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 

tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 

tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 

aaatacatat actaatcaac tggaaatgta aatatt-tgct-raatatttc-ta ctataggaga 11100 

attaaagtga gtgaatatgg -taccacaagg tttggagatt taattgttgc aatgctgcat 11160 

ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 1122 0 

tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 

taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 

tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 

aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 11460 

ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 

agccagccca ccgcggtgga aa atg gag gtc gtg gag aga ttc tac ggt gag 11572 

Met Glu Val Val Glu Arg Phe Tyr Gly Glu 
1 5 10 

ttg gat ggg aag gtc teg cag ggc gtg aat gca ttg ctg ggt agt ttt 11620 
Leu Asp Gly Lys Val Ser Gin Gly Val Asn Ala Leu Leu Gly Ser Phe 
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15 20 '25 

ggg gtg gag ttg acg gat acg ccc act acc aaa ggc ttg ccc etc gtt 11668 
Gly Val Glu Leu Thr Asp Thr Pro Thr Thr Lys Gly Leu Pro Leu Val 
30 35 40 

gac agt ccc aca ccc ate gtc etc ggt gtt tct gta tac ttg act att 11716 
Asp Ser Pro Thr Pro lie Val Leu Gly Val Ser Val Tyr Leu Thr He 
45 50 55 

gtc att gga ggg ctt ttg tgg ata aag gec agg gat ctg aaa ccg cgc 11764 
Val He Gly Gly Leu Leu Trp He Lys Ala Arg Asp Leu Lys Pro Arg 
60 65 70 

gec teg gag cca ttt ttg etc caa get ttg gtg ctt gtg cac aac ctg 11812 
Ala Ser Glu Pro Phe Leu Leu Gin Ala Leu Val Leu Val His Asn Leu 
75 80 85 90 

ttc tgt ttt gcg etc agt ctg tat atg tgc gtg ggc ate get tat cag 11860 
Phe Cys Phe Ala Leu Ser Leu Tyr Met Cys Val Gly lie Ala Tyr Gin 

95 100 105 

get att acc tgg egg tac tct etc tgg ggc aat gca tac aat cct aaa 11908 
Ala He Thr Trp Arg Tyr Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys 
110 115 120 

cat aaa gag atg gcg att ctg gta tac ttg ttc tac atg tct aag tac 11956 
His Lys Glu Met Ala lie Leu Val Tyr Leu Phe Tyr Met Ser Lys Tyr 
125 130 135 

gtg gaa ttc atg gat acc gtt ate atg ata ctg aag cgc age acc agg 12 004 
Val Glu Phe Met Asp Thr Val He Met lie Leu Lys Arg Ser Thr Arg 
140 145 150 

caa ata age ttc etc cac gtt tat cat cat tct tea att tec etc att 12052 
Gin He Ser Phe Leu His Val Tyr His His Ser Ser lie Ser Leu He 
155 160 165 170 

tgg tgg get att get cat cac get cct ggc ggt gaa gca tat tgg tct 12100 
Trp Trp Ala He Ala His His Ala Pro Gly Gly Glu Ala Tyr Trp Ser 
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175 180 '185 

gcg get ctg aac tea gga gtg cat gtt etc atg tat gcg tat tac ttc 1214 8 
Ala Ala Leu Asn Ser Gly Val His Val Leu Met Tyr Ala Tyr Tyr Phe 
190 195 200 

ttg get gee tgc ctt cga agt age cca aag tta aaa aat aag tac ctt 12196 
Leu Ala Ala Cys Leu Arg Ser Ser Pro Lys Leu Lys Asn Lys Tyr Leu . 
205 210 215 

ttt tgg ggc agg tac ttg aca caa ttc caa atg -ttc cag ttt atg ctg 1224 4 
Phe Trp Gly Arg Tyr Leu Thr Gin Phe Gin Met Phe Gin Phe Met Leu 
' 220 225 230 

aac tta gtg cag get tac -tac gac atg aaa acg aat gcg cca tat cca 12292 
Asn Leu val Gin Ala Tyr Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro 

235 ~ ~ 240 245 " * 250 *" 

caa tgg ctg ate" aag att ttig- ttc ta^ tac atg ate teg ttg ctg ttt 12340 
Gin Trp Leu lie Lys lie Leu Phe Tyr Tyr Met lie Ser Leu Leu Phe 

• - - - •: 255 - " --.v™-. " 260 - - 265 " : 

ctt ttc ggc aat ttt tac gta caa aaa tac ate aaa ccc tct gac gga 12388 
Leu Phe Gly Asn Phe Tyr Val Gin Lys Tyr He Lys Pro Ser Asp Gly 
270 275 280 

aag caa aag gga get aaa act gag tga tctagaaggc ctcctgcttt 124 35 

Lys Gin Lys Gly Ala Lys Thr Glu 
285 290 

aatgagatat gcgagacgcc tatgatcgea tgatatttgc tttcaattct gttgtgcacg 12495 

ttgtaaaaaa cctgagcatg tgtagctcag atccttaccg ccggtttcgg ttcattctaa 12555 

tgaatatatc acccgttact ategtatttt tatgaataat attctcegtt caatttactg 12615 

attgtccgtc gagcaaattt acacattgcc actaaaegtc taaacccttg taatttgttt 12675 



ttgttttact atgtgtgtta tgtatttgat ttgcgataaa tttttatatt tggtactaaa 12735 
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tttataacac cttttatgct aacgtttgcc aacacttagc aatttgcaag ttgattaatt 12795 

gattctaaat tatttttgtc ttctaaatac atatactaat caactggaaa tgtaaatatt 12855 

tgctaatatt tctactatag gagaattaaa gtgagtgaat atggtaccac aaggtttgga 12915 

gatttaattg ttgcaatgct gcatggatgg catatacacc aaacattcaa taattcttga 12975 

ggataataat ggtaccacac aagatttgag gtgcatgaac gtcacgtgga caaaaggttt 13035 

agtaattttt caagacaaca atgttaccac acacaagttt tgaggtgcat gcatggatgc 13095 

cctgtggaaa gtttaaaaat attttggaaa tgatttgcat ggaagccatg tgtaaaacca 13155 

tgacatccac ttggaggatg caataatgaa gaaaactaca aatttacatg caactagtta 13215 

tgcatgtagt ctatataatg aggattttgc aatactttca ttcatacaca ctcactaagt 13275 

tttacacgat tataatttct tcatagccag cggatcc atg gta ttc gcg ggc ggt 13330 

Met val Phe Ala Gly Gly 
295 

gga ctt cag cag ggc tct etc gaa gaa aac ate gac gtc gag cac att 1337 8 
Gly Leu Gin Gin Gly Ser Leu Glu Glu Asn He Asp Val Glu His lie 
300 305 310 

gec agt atg tct etc ttc age gac ttc ttc agt tat gtg tct tea act 13426 
Ala Ser Met Ser Leu Phe Ser Asp Phe Phe Ser Tyr Val Ser Ser Thr 
315 320 325 

gtt ggt teg tgg age gta cac agt ata caa cct ttg aag cgc ctg acg 13474 
val Gly Ser Trp Ser Val His Ser He Gin Pro Leu Lys Arg Leu Thr 
330 335 340 345 

agt aag aag cgt gtt teg gaa age get gee gtg caa tgt ata tea get 13522 
Ser Lys Lys Arg Val Ser Glu Ser Ala Ala Val Gin Cys He Ser Ala 
350 355 360 

gaa gtt cag aga aat teg agt acc cag gga act gcg gag gca etc gca 13570 
Glu Val Gin Arg Asn Ser Ser Thr Gin Gly Thr Ala Glu Ala Leu Ala 
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365 370 375 

gaa tea gtc gtg aag ccc acg aga cga agg tea tct cag tgg aag aag 13618 
Glu Ser Val Val Lys Pro Thr Arg Arg Arg Ser Ser Gin Trp Lys Lys 
380 385 390 

teg aca cac ccc eta tea gaa gta gca gta cac aac aag cca age gat 13666 
Ser Thr His Pro Leu Ser Glu Val Ala Val His Asn Lys Pro Ser Asp 
395 400 405 

tgc tgg att gtt gta aaa aac aag gtg tat gat gtt tec aat ttt gcg 13714 
Cys Trp lie Val Val Lys Asn Lys Val Tyr Asp Val Ser Asn Phe Ala 
410 415 420 . 425 

gac gag cat ccc gga gga tea gtt att agt act tat ttt gga cga gac 13762 
Asp Glu His Pro Gly Gly Ser Val lie Ser Thr Tyr Phe Gly. Arg Asp 
430 435 * 440 

ggc aca gat- gtt ttc tct agt ; ttt. cat gca get tct aca tgg-aaa att 13 810 
Gly Thr Asp Val Phe Ser Ser Phe His Ala Ala Ser Thr Trp Lys lie 

445- 450 --- . 455 ■ V . 

ctt caa gac ttt tac att ggt gac gtg gag agg gtg gag ccg act cca 1385 8 
Leu Gin Asp Phe Tyr lie Gly Asp Val Glu Arg Val Glu Pro Thr Pro 
460 . 465 470 

gag ctg ctg aaa gat ttc cga gaa atg aga get ctt ttc ctg agg gag 13906 
Glu Leu Leu Lys Asp Phe Arg Glu Met Arg Ala Leu Phe Leu Arg Glu 
475 480 485 

caa ctt ttc aaa agt teg aaa ttg tac tat gtt atg aag ctg etc acg 13954 
Gin Leu Phe Lys Ser Ser Lys Leu Tyr Tyr Val Met Lys Leu Leu Thr 
490 495 500 505 

aat gtt get att ttt get gcg age att gca ata ata tgt tgg age aag 14002 
Asn Val Ala lie Phe Ala Ala Ser lie Ala Tie lie Cys Trp Ser Lys 
510 515 520 



act att tea gcg gtt ttg get tea get tgt atg atg get ctg tgt ttc 
Thr lie Ser Ala Val Leu Ala Ser Ala Cys Met Met Ala Leu Cys Phe 



14050 
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525 530 53S 

caa cag tgc gga tgg eta tec cat gat ttt etc cac aat cag gtg ttt 14098 
Gin Gin Cys Gly Trp Leu Ser His Asp Phe Leu His Asn Gin Val Phe 
540 545 550 

gag aca cgc tgg ctt aat gaa gtt gtc ggg tat gtg ate ggc aac gee l 4 i 46 
Glu Thr Arg Trp Leu Asn Glu Val Val Gly Tyr Val lie Gly Asn Ala 
555 560 565 

gtt ctg ggg ttt agt aca ggg tgg tgg aag gag aag cat aac ctt cat 14194 
Val Leu Gly Phe Ser Thr Gly Trp Trp Lys Glu Lys His Asn Leu His 
570 575 580 585 

cat get get cca aat gaa tgc gat cag act tac caa cca att gat gaa 14242 
His Ala Ala Pro Asn Glu Cys Asp Gin Thr Tyr Gin Pro He Asp Glu 
590 595 goo 

gat att gat act etc ccc. etc att gec tgg age aag gac ata ctg gee 14290 
Asp He Asp Thr Leu Pro Leu He Ala Trp Ser Lys Asp He Leu Ala 
605 .610 615 

aca gtt gag aat aag aca ttc ttg cga ate etc caa tac cag cat ctg 14 338 
Thr val Glu Asn Lys Thr Phe Leu Arg He Leu Gin Tyr Gin His Leu 
620 625 630 

ttc ttc atg ggt ctg tta ttt ttc gee cgt ggt agt tgg etc ttt tgg 14386 
Phe Phe Met Gly Leu Leu Phe Phe Ala Arg Gly Ser Trp Leu Phe Trp 
fi 35 640 645 



age tgg aga tat acc tct aca gca gtg etc tea cct gtc gac agg ttg 14434 
Ser Trp Arg Tyr Thr Ser Thr Ala Val Leu Ser Pro Val Asp Arg Leu 

665 



650 655 660 



ttg gag aag gga act gtt ctg ttt cac tac ttt tgg ttc gtc ggg aca 14482 
Leu Glu Lys Gly Thr Val Leu Phe His Tyr Phe Trp Phe Val Gly Thr 
670 675 680 

gcg tgc tat ctt etc cct ggt tgg aag cca tta gta tgg atg gcg gtg 14 530 
Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro Leu Val Trp Met Ala Val 
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685 690 695 

act gag etc atg tec ggc atg ctg ctg ggc ttt gta ttt gta ctt age 14578 
Thr Glu Leu Met Ser Gly Met Leu Leu Gly Phe Val Phe Val Leu Ser 
700 705 710 

cac aat ggg atg gag gtt tat aat teg tct aaa gaa ttc gtg agt gca 14 626 
His Asn Gly Met Glu Val Tyr Asn Ser Ser Lys Glu Phe Val Ser Ala 
715 720 725 

cag ate gta tec aca egg gat ate aaa gga aac ata ttc aac gac tgg 14674 
Gin lie Val Ser Thr Arg Asp lie Lys Gly Asn lie Phe Asn Asp Trp 
730 735 740 . 745 

ttc act ggt ggc ctt aac agg caa ata gag cat cat ctt ttc cca aca 14722 
Phe Thr Gly Gly Leu Asn Arg Gin lie Glu His His Leu Phe Pro Thr 
750 755 760 

atg ccc -agg cat aat tta aac aaa" ata : gca cct aga gtg gag gtg v ttc 14770 
Met Pro Arg His Asn Leu Asn Lys lie Ala Pro Arg Val Glu Val Phe 
765 770 775 

tgt aag aaa cac ggt ctg gtg tac gaa gac gta tct att get acc ggc 14818 
Cys Lys Lys His Gly Leu Val Tyr Glu Asp Val Ser lie Ala Thr Gly 
780 785 790 

act tgc aag gtt ttg aaa gca ttg aag gaa gtc gcg gag get gcg gca 14866 
Thr Cys Lys Val Leu Lys Ala Leu Lys Glu Val Ala Glu Ala Ala Ala 
795 800 805 

gag cag cat get acc acc agt taa getagegtta accctgcttt aatgagatat 14 920 
Glu Gin His Ala Thr Thr Ser 
810 815 

gcgagacgcc tatgatcgea tgatatttgc tttcaattct gttgtgcacg ttgtaaaaaa 14 980 
cctgagcatg tgtagctcag atccttaccg ccggtttcgg ttcattctaa tgaatatatc 15040 



acccgttact ategtatttt tatgaataat attctcegtt caatttactg attgtccgtc 15100 
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gagcaaattt acacattgcc actaaacgtc taaacccttg taatt'tigttt ttgttttact 15160 

atgtgtgtta tgtatttgat ttgcgataaa tttttatatt tggtactaaa tttataacac 15220 

cttttatgct aacgtttgcc aacacttagc aatttgcaag ttgattaatt gattctaaat 15280 

tatttttgtc ttctaaatac atatactaat caactggaaa tgtaaatatt tgctaatatt 15340 

tctactatag gagaattaaa gtgagtgaat atggtaccac aaggtttgga gatttaattg 15400 

ttgcaatgct gcatggatgg catatacacc aaacattcaa taattcttga ggataataat 15460 

ggtaccacac aagatttgag gtgcatgaac gtcacgtgga caaaaggttt agtaattttt 15520 

caagacaaca atgttaccac acacaagttt tgaggtgcat gcatggatgc cctgtggaaa 15580 

gtttaaaaat attttggaaa tgatttgcat ggaagccatg tgtaaaacca tgacatccac 15640 

ttggaggatg caataatgaa gaaaactaca aatttacatg caactagtta tgcatgtagt 15700 

ctatataatg aggattttgc aatactttca . ttcatacaca ctcactaagt tttacacgat 15760 

tataatttct tcatagccag cagatctaaa atg get ccg gat gcg gat aag ctt 15814 

Met Ala Pro Asp Ala Asp Lys Leu - 
820 825 

cga caa cgc cag acg act gcg gta gcg aag cac aat get get ace ata 15862 
Arg Gin Arg Gin Thr Thr Ala Val Ala Lys His Asn Ala Ala Thr lie 
830 835 840 

teg acg cag gaa cgc ctt tgc agt ctg tct teg etc aaa ggc gaa gaa 15910 
Ser Thr Gin Glu Arg Leu Cys Ser Leu Ser Ser Leu Lys Gly Glu Glu 
845 850 855 

gtc tgc ate gac gga ate ate tat gac etc caa tea ttc gat cat ccc 15958 
Val Cys lie Asp Gly lie He Tyr Asp Leu Gin Ser Phe Asp His Pro 
860 865 870 

999 99t gaa acg ate aaa atg ttt ggt ggc aac gat gtc act gta cag 16006 
Gly Gly Glu Thr He Lys Met Phe Gly Gly Asn Asp Val Thr Val Gin 
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875 880 885 

tac aag atg att cac ccg tac cat acc gag aag cat ttg gaa aag atg 16054 
Tyr Lys Met lie His Pro Tyr His Thr Glu Lys His Leu Glu Lys Met 
890 895 900 905 

aag cgt gtc ggc aag gtg acg gat ttc gtc tgc gag tac aag ttc gat 16102 
Lys Arg Val Gly Lys Val Thr Asp Phe Val Cys Glu Tyr Lys Phe Asp 
910 915 920 

acc gaa ttt gaa cgc gaa ate aaa cga gaa gtc ttc aag att gtg cga 16150 
Thr Glu Phe Glu Arg Glu lie Lys Arg Glu Val Phe Lys lie Val Arg 
925 930 .935 

cga ggc aag gat ttc ggt act ttg gga tgg ttc ttc cgt gcg ttt tgc 16198 
Arg Gly Lys Asp Phe Gly Thr Leu Gly Trp Phe Phe Arg Ala Phe Cys 
940 945 950 

tac att gec att ttc ttc tac ctg cag tac cat -tgg gtc acc acg gga- 16246 
Tyr lie Ala lie Phe Phe Tyr Leu Gin Tyr His "Trp Val Thr Thr Gly 
955 960 ' 965 

acc tct tgg ctg ctg gec gtg gec tac gga ate tec caa gcg atg att 16294 
Thr Ser Trp Leu Leu Ala Val Ala Tyr Gly lie Ser Gin Ala Met lie 
970 975 980 985 

ggc atg aat gtc cag cac gat gec aac cac ggg gec acc tec aag cgt 16342 
Gly Met Asn Val Gin His Asp Ala Asn His Gly Ala Thr Ser Lys Arg 
990 995 1000 

ccc tgg gtc aac gac atg eta ggc etc ggt gcg gat ttt att ggt ggt 16390 
Pro Trp Val Asn Asp Met Leu Gly Leu Gly Ala Asp Phe lie Gly Gly 
1005 1010 1015 

tec aag tgg etc tgg cag gaa caa cac tgg acc cac cac get tac acc 16438 
Ser Lys Trp Leu Trp Gin Glu Gin His Trp Thr His His Ala Tyr Thr 
1020 1025 1030 



aat cac gee gag atg gat ccc gat age ttt ggt gee gaa cca atg etc 
Asn His Ala Glu Met Asp Pro Asp Ser Phe Gly Ala Glu Pro Met Leu 



16486 
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1035 



1040 



1045 



eta ttc aac gac tat ccc ttg gat cat ccc get egt acc tgg eta cat 
Leu Phe Asn Asp Tyr Pro Leu Asp His Pro Ala Arg Thr Trp Leu His 
1050 1055 1060 1065 



16534 



cgc ttt caa gca ttc ttt tac atg ccc gtc ttg get gga tac tgg ttg 
Arg Phe Gin Ala Phe Phe Tyr Met Pro Val Leu Ala Gly Tyr Trp Leu 
1070 1075 1080 



16582 



tec get gtc ttc aat cca caa att ctt gac etc cag caa cgc ggc gca 16630 
Ser Ala Val Phe Asn Pro Gin lie Leu Asp Leu Gin Gin Arg Gly Ala 
1085 1090 1095 



ctt tec gtc ggt ate egt etc gac aac get ttc att cac teg cga cgc 
Leu Ser Val Gly He Arg Leu Asp Asn Ala Phe lie His Ser Arg Arg 
1100 1105 1110 



16678 



aag tat gcg gtt ttc tgg egg get gtg tac att gcg gtg aac gtg att 
Lys Tyr Ala Val Phe Trp Arg Ala Val Tyr He Ala Val Asn Val He 
1115 1120 1125 



16726 



get ccg ttt tac aca aac tec ggc etc gaa tgg tec tgg egt gtc ttt 
Ala Pro Phe Tyr Thr Asn Ser Gly Leu Glu Trp Ser Trp Arg Val Phe 
1130 1135 H40 H45 



16774 



gga aac ate atg etc atg ggt gtg gcg gaa teg etc gcg ctg gcg gtc 
Gly Asn lie Met Leu Met Gly Val Ala Glu Ser Leu Ala Leu Ala Val 
1150 1155 1160 



16822 



ctg ttt teg ttg teg cac aat ttc gaa tec gcg gat cgc gat ccg acc 
Leu Phe Ser Leu Ser His Asn Phe Glu Ser Ala Asp Arg Asp Pro Thr 
1165 1170 1175 



16870 



gec cca ctg aaa aag acg gga gaa cca gtc gac tgg ttc aag aca cag 
Ala Pro Leu Lys Lys Thr Gly Glu Pro Val Asp Trp Phe Lys Thr Gin 
1180 1185 H90 



16918 



gtc gaa act tec tgc act tac ggt gga ttc ctt tec ggt tgc ttc acg 
Val Glu Thr Ser Cys Thr Tyr Gly Gly Phe Leu Ser Gly Cys Phe Thr 



16966 
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1195 1200 1205 ' 

gga ggt etc aac ttt cag gtt gaa cac cac ttg ttc cca cgc atg age 17014 
Gly Gly Leu'Asn Phe Gin Val Glu His His Leu Phe Pro Arg Met Ser 
1210 1215 1220 1225 

age get tgg tat. ccc tac att gee ccc aag gtc cgc gaa att tgc gec 17062 
Ser Ala Trp Tyr Pro Tyr lie Ala Pro Lys Val Arg Glu lie Cys Ala 
1230 1235 1240 

aaa cac ggc gtc cac tac gec tac tac ccg tgg ate cac caa aac ttt 17110 
Lys His Gly Val His Tyr Ala Tyr Tyr Pro Trp lie His Gin Asn Phe 
1245 1250 1255 

etc tec acc gtc cgc tac atg cac gcg gec ggg acc ggt gec aac tgg 17158 
Leu Ser Thr Val Arg Tyr Met His Ala Ala Gly Thr Gly Ala Asn Trp \ 
1260 1265 1270 

cgc rcag atg gec aga v gaa aat-ccc ttg- acc gga egg- gcg taa ; •'-.*■ " ^ 17200 
Arg Gin Met Ala Arg Glu Asn -Pro. Leu Thr Gly Arg Ala . -W* 
1275 1280 1285 

agatctgecg gcatcgatcc egggecatgg ectgetttaa tgagatatgc gagacgecta 17260 

tgatcgcatg atatttgett tcaattctgt tgtgcacgtt gtaaaaaacc tgagcatgtg 17320 



tagctcagat ccttaccgcc ggtttcggtt cattctaatg aatatatcac ccgttactat 17 380 
cgtattttta tgaataatat tctccgttca atttactgat tgtcegtega cgagctcggc 17440 
gcgcctctag aggatcgatg aattcagatc ggctgagtgg ctccttcaac gttgcggttc 17500 
tgtcagttcc aaacgraaaa cggcttgtcc cgcgtcatcg gegggggtea taacgtgact 17560 
cccttaattc tccgctcatg atcagattgt cgtttcccgc cttcagttta aactatcagt 17620 
gtttgacagg atatattggc ggg-taaacct aagagaaaag agegtttatt agaataatcg 17680 
gatatttaaa agggcgtgaa aaggtttatc cttcgtccat ttgtatgtgc atgccaacca 17740 
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cagggttccc ca 17752 



<210> 47 
<211> 290 
<212> PRT 
<213> Unknown 

<400> 47 

Met Glu Val Val Glu Arg Phe Tyr Gly Glu Leu Asp Gly Lys Val Ser 
15 10 15 

Gin Gly Val Asn Ala Leu Leu Gly Ser Phe Gly Val Glu Leu Thr Asp 
20 25 30 

Thr Pro Thr Thr Lys Gly Leu Pro Leu Val Asp Ser Pro Thr Pro lie 
35 40 45 

Val Leu Gly Val Ser Val Tyr Leu Thr lie Val He Gly Gly Leu Leu 
50 55 60 

Trp He Lys Ala Arg Asp Leu Lys Pro Arg Ala Ser Glu Pro Phe Leu 
65 70 75 80 

Leu Gin Ala Leu Val Leu Val His Asn Leu Phe Cys Phe Ala Leu Ser 

85 90 95 

Leu Tyr Met Cys Val Gly He Ala Tyr Gin Ala lie Thr Trp Arg Tyr 
100 105 110 

Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys His Lys Glu Met Ala He 
115 120 125 

Leu Val Tyr Leu Phe Tyr Met Ser Lys Tyr Val Glu Phe Met Asp Thr 
130 135 140 

Val He Met He Leu Lys Arg Ser Thr Arg Gin He Ser Phe Leu His 
145 150 155 160 



Val Tyr His His Ser Ser He Ser Leu He Trp Trp Ala He Ala His 
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165 170 175 

His Ala Pro Gly Gly Glu Ala Tyr Trp Ser Ala Ala Leu Asn . Ser Gly 
180 185 190 

Val His Val Leu Met Tyr Ala Tyr Tyr Phe Leu Ala Ala Cys Leu Arg 
195 200 205 

Ser Ser Pro Lys Leu Lys Asn Lys Tyr Leu Phe Trp Gly Arg Tyr Leu 
210 215 220 

Thr Gin Phe Gin Met Phe Gin Phe Met Leu Asn Leu Val Gin Ala Tyr 
22$ 230 235 . . 240 

Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro Gin Trp Leu lie Lys lie 
245 ' 250 255 

Leu Phe Tyr Tyr Met lie Ser Leu Leu Phe Leu Phe Gly Asn Phe Tyr 

. .. . . . : -. 260 ^ - ~ - 265 .- . ->-\~ -V . : . . -V\* 2 70 

Val Gin Lys Tyr lie Lys Pro Ser Asp Gly Lys Gin Lys Gly Ala Lys 
275 280 285 

Thr Glu 
290 



<210> 48 
<211> 525 
<212> PRT 
<213> Unknown 

<400> 48 

Met Val Phe Ala Gly Gly Gly Leu Gin Gin Gly Ser Leu Glu Glu Asn 
15 10 15 

lie Asp Val Glu His lie Ala Ser Met Ser Leu Phe Ser Asp Phe Phe 
20 25 30 

Ser Tyr Val Ser Ser Thr Val Gly Ser Trp Ser Val His Ser He Gin 
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35 40 45 

Pro Leu Lys Arg Leu Thr Ser Lys Lys Arg val Ser Glu Ser Ala Ala 
50 55 60 

Val Gin Cys He Ser Ala Glu Val Gin Arg Asn Ser Ser Thr Gin Gly 
65 70 75 80 

Thr Ala Glu Ala Leu Ala Glu Ser Val Val Lys Pro Thr Arg Arg Arg 

85 90 95 

Ser Ser Gin Trp Lys Lys Ser Thr His Pro Leu Ser Glu Val Ala Val 
100 105 HO 

His Asn Lys Pro Ser Asp Cys Trp He Val val Lys Asn Lys Val Tyr 
H5 120 125 

Asp val Ser Asn Phe Ala Asp Glu His Pro Gly Gly Ser Val He Ser 
130 135 140 

Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val Phe Ser Ser Phe His Ala 
145 150 I 55 I 60 

Ala Ser Thr Trp Lys He Leu Gin Asp Phe Tyr He Gly Asp Val Glu 
165 170 175 

Arg Val Glu Pro Thr Pro Glu Leu Leu Lys Asp Phe Arg Glu Met Arg 
180 185 190 

Ala Leu Phe Leu Arg Glu Gin Leu Phe Lys Ser Ser Lys Leu Tyr Tyr 
195 200 205 

Val Met Lys Leu Leu Thr Asn Val Ala He Phe Ala Ala Ser He Ala 
210 215 220 

He He Cys Trp Ser Lys Thr He Ser Ala Val Leu Ala Ser Ala Cys 
225 230 235 240 

Met Met Ala Leu Cys Phe Gin Gin Cys Gly Trp Leu Ser His Asp Phe 
245 250 25S 
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Leu His Asn Gin Val Phe Glu Thr Arg Trp Leu Asn Glu Val Val Gly 
260 265 270 

Tyr Val He Gly Asn Ala Val Leu Gly Phe Ser Thr Gly Trp Trp Lys 
275 280 285 

Glu Lys His Asn Leu His His Ala Ala Pro Asn Glu Cys Asp Gin Thr 
290 295 300 

Tyr Gin Pro He Asp Glu Asp He Asp Thr Leu Pro Leu He Ala Trp 
305 310 315 320 

Ser Lys Asp lie Leu Ala Thr Val Glu Asn Lys Thr Phe Leu Arg He 
325 330 335 

Leu Gin Tyr Gin His Leu Phe Phe Met Gly Leu Leu Phe Phe Ala Arg 
340 345 350 

Gly Ser Trp Leu Phe Trp Ser Trp Arg Tyr Thr Ser Thr Ala Val Leu 

355 360 : 365 - 

Ser Pro val Asp Arg Leu Leu Glu Lys Gly Thr Val Leu Phe His Tyr 
370 375 380 

Phe Trp Phe Val Gly Thr Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro 
385 390 395 400 

Leu Val Trp Met Ala Val Thr Glu Leu Met Ser Gly Met Leu Leu Gly 
405 410 415 

Phe Val Phe Val Leu Ser His Asn Gly Met Glu Val Tyr Asn Ser Ser 
420 425 430 

Lys Glu Phe Val Ser Ala Gin lie Val Ser Thr Arg Asp lie Lys Gly 
435 440 445 



Asn He Phe Asn Asp Trp Phe Thr Gly Gly Leu Asn Arg Gin He Glu 
450 455 460 
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His His Leu Phe 
465 

Pro Arg Val Glu 



Val Ser He Ala 
500 

Val Ala Glu Ala 
515 



Pro Thr Met Pro 
470 

Val Phe Cys Lys 
485 

Thr Gly Thr Cys 



Ala Ala Glu Gin 
520 



204 

Arg His Asn Leu 
475 

Lys His Gly Leu 
490 

Lys Val Leu Lys 
505 

His Ala Thr Thr 



Asn Lys He Ala 
480 

Val Tyr Glu Asp 
495 

Ala Leu Lys Glu 
510 

Ser 
525 



<210> 49 
<211> 469 
<212> PRT 
<213> Unknown 

<400> 49 

Met Ala Pro Asp Ala Asp Lys Leu Arg Gin Arg Gin Thr Thr Ala Val 
1 5 10 15 

Ala Lys His Asn Ala Ala Thr He Ser Thr Gin Glu Arg Leu Cys Ser 
20 .25 30 

Leu Ser Ser Leu Lys Gly Glu Glu Val Cys lie Asp Gly He He Tyr 
35 40 45 

Asp Leu Gin Ser Phe Asp His Pro Gly Gly Glu Thr lie Lys Met Phe 
50 55 60 

Gly Gly Asn Asp Val Thr Val Gin Tyr Lys Met He His Pro Tyr His 
65 70 75 80 

Thr Glu Lys His Leu Glu Lys Met Lys Arg Val Gly Lys Val Thr Asp 

85 90 95 



Phe Val Cys Glu Tyr Lys Phe Asp Thr Glu Phe Glu Arg Glu He Lys 
100 105 110 
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Arg Glu Val Phe Lys lie Val Arg Arg Gly Lys Asp Phe Gly Thr Leu 
115 120 125 

Gly Trp Phe Phe Arg Ala Phe Cys Tyr lie Ala lie Phe Phe Tyr Leu 
130 135 140 

Gin Tyr His Trp Val Thr Thr Gly Thr Ser Trp Leu Leu Ala Val Ala 
145 150 155 160 

Tyr Gly lie Ser Gin Ala Met lie Gly Met Asn Val Gin His Asp Ala 
165 170 175 

Ash His Gly Ala Thr Ser Lys Arg Pro Trp Val Asn Asp Met Leu Gly 
180 185 190 

Leu Gly Ala Asp Phe lie Gly Gly Ser Lys Trp Leu Trp Gin Glu Gin 
195 . 200 205 

His Trp Thr His His Ala Tyr-Thr Asn His Ala Glu Met Asp Pro Asp 

210 215 f220 .. . 

Ser Phe Gly Ala Glu Pro Met Leu Leu Phe Asn Asp Tyr Pro Leu Asp 
225 230 235 240 

His Pro Ala Arg Thr Trp Leu His Arg Phe Gin Ala Phe Phe Tyr Met 
245 250 255 

Pro Val Leu Ala Gly Tyr Trp Leu Ser Ala Val Phe Asn Pro Gin He 
260 265 270 

Leu Asp Leu Gin Gin Arg Gly Ala Leu Ser Val Gly He Arg Leu Asp 
275 280 285 

Asn Ala Phe lie His Ser Arg Arg Lys Tyr Ala Val Phe Trp Arg Ala 
290 295 300 

Val Tyr lie Ala Val Asn Val lie Ala Pro Phe Tyr Thr Asn Ser Gly 
305 310 315 320 

Leu Glu Trp Ser Trp Arg Val Phe Gly Asn He Met Leu Met Gly Val 
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Ala Glu Ser Leu 
340 

Glu Ser Ala Asp 
355 

Pro Val Asp Trp 
370 

Gly Phe Leu Ser 
385 

His His Leu Phe 



Pro Lys Val Arg 
420 

Tyr Pro Trp lie 
435 

Ala Ala Gly Thr 
450 

Leu Thr Gly Arg 
465 



325 

Ala Leu Ala Val 



Arg Asp Pro Thr 
360 

Phe Lys Thr Gin 
375 

Gly Cys Phe Thr 
390 

Pro Arg Met Ser 
405 

Glu lie Cys Ala 



His Gin Asn Phe 
440 

Gly Ala Asn Trp 
455 

Ala 



206 

330 

Leu Phe Ser Leu 
345 

Ala Pro Leu Lys 



Val Glu Thr Ser 
380 

Gly Gly Leu Asn 
395 

Ser Ala Trp Tyr 
410 

Lys His Gly Val 
425 

Leu Ser Thr Val 



Arg Gin Met Ala 
460 



335 

Ser His Asn Phe 
350 

Lys Thr Gly Glu 
365 

Cys Thr Tyr Gly 



Phe Gin Val Glu 
400 

Pro Tyr lie Ala 
415 

His Tyr Ala Tyr 
430 

Arg Tyr Met His 
445 

Arg Glu Asn Pro 



<210> 50 
<211> 26 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Polylinker 



<400> 50 

gaattcggcg cgccgagctc ctcgag 
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<210> 51 
<211> 265 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Polylinker-terminator-polylinker 
<400> 51 

ccaccgcgg-t gggcggccgc ctgcagtcta gaaggcctcc tgctttaatg agatatgcga 60 
gacgcctatg atcgcatgat atttgctttc aattctgttg tgcacgttgt aaaaaacctg 120 
agcatgtgta gctcagatcc ttaccgccgg tttcggttca ttctaatgaa tatatcaccc 18 0 
gttactatcg tatttttatg aataatattc tccgttcaat ttactgattg tccgtcgacg 2 40 
aattcgagct cggcgcgcca agctt — 2 65 

<210> 52 
<211> 257 
<212> DNA 

<213> Artificial sequence 
<220> 

<2 2 3> Polylinker-terminator-poly linker 
<400> 52 

ggatccgata tcgggcccgc tagcgttaac cctgctttaa tgagatatgc gagacgccta 60 
tgatcgcatg atatttgctt tcaattctgt tgtgcacgtt gtaaaaaacc tgagcatgtg 120 
tagctcagat ccttaccgcc ggtttcggtt cattctaatg aatatatcac ccgttactat 180 
cgtattttta tgaataatat tctccgttca atttactgat tgtccgtcga cgaattcgag ,2 40 



ctcggcgcgc caagctt 



257 
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<210> 53 
<211> 257 
<212> DHA 

<213> Artificial sequence 
<220> 

<223> Polylinker-terminator-polylinker 
<400> 53 

agatctgccg gcatcgatcc cgggccatgg cctgctttaa tgagatatgc gagacgccta 60 
tgatcgcatg at-atttgctt -tcaattctgt, t-gtgcacgtt, gtaaaaaacc tgagcatgtg 120 
tagctcagat ccttaccgcc ggtttcggtt cattctaatg aatatatcac ccgttactat 180 
cgtattttta tgaataatat tctccgttca atttactgat tgtccgtcga cgaattcgag 240 



ctcggcgcgc caagctt 



257 



